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PLAG gene family and tumorigenesis 

The present invention relates to the identification of 

the PLAG gene family as a family of genes frequently 

f& c ^ . . . ^ . . Description 0+ Refo.4e<f tfrt 

5 associated with tumorigenesis y^The invention in particular 

relates to the identification of one member of this gene 

family that is involved in benign tumors such as those with 

chromosome anomalies involving a particular region of the 

long arm of chromosome 8, for instance but not restricted to 

10 pleomorphic adenomas of the salivary glands with involvement 
of chromosome 8ql2 and a translocation partner chromosome, 
very frequently chromosome 3 [i.e. t(3;8) (p2l;ql2) ] . in 
addition, the invention relates to another member of this 
novel family as the gene whose expression is frequently 

15 abrogated in malignant tumors such as for example but not 
restricted to malignant salivary gland tumors. Furthermore, 
the invention concerns the identification of the CTNNBl gene 
as a prototype of tumor-specific breakpoint region genes and 
frequent fusion partner of the PLAG genes. The invention 

20 relates in particular to the use of the members of the PLAG 
gene family and corresponding fusion partner genes as well 
as their derivatives in diagnosis and therapy. 

Multiple independent cytogenetic studies have firmly 
implicated chromosome region 8qll-l3 in the following tumor 

25 types: pleomorphic adenomas of the salivary glands (high 
frequency) , pleomorphic adenomas of the lacrimal glands 
(high frequency) , lipoblastomas (high frequency) , solitary 
lipomas (low frequency), rhabdomyosarcomas (low frequency), 
and renal cell carcinomas (low frequency) . As an 

30 illustration, we here focus on pleomorphic adenomas of the 
salivary gland. This type of tumor constitutes a benign 
epithelial tumor that originates from the major and minor 
salivary glands. It is the most common type of salivary 
gland tumor and accounts for almost 50% of all neoplasms in 

3 5 these organs; 8 5% of the tumors are found in the parotid 
gland, 10% in the minor salivary glands and 5% in the 
submandibular gland. About 50% of these adenomas appear to 
have a normal karyotype but cytogenetic studies have also 
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revealed recurrent specific chromosome anomalies. Frequently 
observed anomalies include aberrations of chromosome 8, 
usually involving the 8ql2 region with as most common 
aberration a t(3;8) (p21;ql2) , and aberrations of chromosome 
5 12, usually translocations involving region 12ql3-ql5. Non- 
recurrent clonal chromosome abnormalities have also been 
reported. The highly specific pattern of chromosome 
rearrangements with consistent breakpoints at 8ql2 and 
12ql3-ql5 suggests that these chromosomal regions harbour 
10 genes that might be implicated in the development of these 
tumors . 

In previous physical mapping studies, the chromosome 
8ql2 breakpoint in pleomorphic adenoma of the salivary 
glands was mapped between MPS and PENK . In an effort to 
15 molecularly clone the gene affected by the chromosome 8ql2 
aberrations in the various tumors, the present inventors 
chose directional chromosome walking as a structural appro- 
ach to define the DNA region encompassing these breakpoints. 
As a starting point for chromosome walking, the MPS 
20 locus was used. During these walking studies, YAC clones 

corresponding to eight different loci in 8qll-l2 were isola- 
ted and mapped by FISH analysis. Initially, the breakpoints 
were mapped within a 1 Mb region flanked by MPS proximal ly 
and by the genetic marker D8S166 distally. one YAC (CEPH 
25 166F4) within this region was shown to span the t(3;8) 

breakpoint in two tumors. Subsequently, it was shown by FISH 
that the chromosomal breakpoints as present in a number of 
primary pleomorphic adenomas were clustered within a small 
chromosomal segment which has been designated pleomorphic 
3 0 adenoma breakpoint cluster region (PA-BCR) . 

It has thus been found that essentially all breakpoints 
of chromosome 8ql2 map in a region indicated herein as PA- 
BCR. Further research revealed that in this, region a member 
of the PLAG gene family, the PLAGl gene, can be identified 
3 5 as the breakpoint target gene and postulated tumor aberrant 
growth gene. The PLAG gene family is a subclass of the zinc 
finger gene superfamily. The CTNNB1 gene was identified as 
the fusion partner gene of PLAGl . Molecular analysis of the 
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chromosome breakpoints revealed that the chromosome translo- 
cations resulted in gene fusions* Fusions appeared to occur 
in the 5 1 -non-coding regions of both genes (PLAG1 and CTNN— 
Bl) , exchanging regulatory control elements while preserving 
5 the coding sequences. Due to the t (3;8) (p21;ql2) , PLAG1 is 
activated and expression of CTNNB1 is down -regulated. Acti- 
vation of PIAG1 was also observed in adenomas with variant 
translocations, i.e. t(5;8) and t(8;15) . The results of the 
present inventors indicate that PLAG1 activation due to 

10 promoter swapping is a crucial event in salivary gland 
tumor igenesis. ~~ - 

The preferential fusion partner of PLAG1 is the CTNNB1 
gene, which encodes B-catenih, a cytoplasmic protein of 
about 88 kD. Its frequent involvemenjr-in pleomorphic adeno- 

15 mas point towards a critical role of the CTNNB1 gene. The 

major role of CTNNB1 may be to provide an active promoter in 
front of the PLAG1 gene. However, since the observed promo- 
ter swapping between PIAG1 and CTNNB1 leads to down-regula- 
tion of CTNNB1 expression, it may also lead to other physio- 

20 logical consequences, especially since B-catenin is a pro- 
tein that has been implicated in highly diverse processes, 
such as human colon cancer, epithelial cell adhesion, embry- 
onal axis formation in Xenoous . and pattern formation in 
Drosophila . 

2 5 According to the present invention the aberrations in 

the PLAG1 gene on chromosome 8 and the CTNNB1 gene on chro- 
mosome 3 have been used as a model to reveal the more gene- 
ral concept of the involvement of members of these gene 
families in tumor igenesis. Although both gene families are 
30 known per se, up till the present invention the correlation 
between these families and tumor inducing chromosome aberra- 
tions, like translocations, deletions, insertions and inver- 
sions, has not been anticipated. Furthermore, until now, it 
was not previously demonstrated that alterations in the 

3 5 physiological expression level of the members of the gene 

family are probably also implicated in tumor development. 
According to the invention, it was demonstrated that in 
normal adult cells the expression level of the PLAGl gene is 
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practically undetectable, whereas in aberrantly growing 
cells, the expression level is significantly increased. It 
appears that PLAGl is a oncogene. 

The research that led to the present invention 
5 indicates that it is probable that abberations in the PLAGl 
gene, either in its sequence and/or its expression, will 
usually lead to a benign tumor. Furthermore, preliminary 
research indicated that expression of PLAGl is increased in 
various types of malignant salivary gland tumors. 

10 Another member of the PLAG gene family is the PLAG2 

gene. The present inventors have identified this gene, 
determined its nucleotide sequence and predicted its amino 
acid sequence. It was found that PLAG 2 mapped to a region 
frequently deleted in malignant salivary gland tumors 

15 (6q24). Thus, contrary to PLAGl . PLAG 2 may be a 
tumorsuppressor gene. 

The present invention now provides for a tool to 
investigate these theories and may ultimately lead to a 
means for distinguishing between benign and malignant 

2 0 tumors. This knowledge can then lead to more efficient 

methods of treatment. Thus, the present invention now provi- 
des for the members of the gene families or derivatives 
thereof in isolated form and their use in diagnostic and 
therapeutic applications. Furthermore, the knowledge on the 

2 5 location and nucleotide sequence of the genes may be used to 

study their rearrangements or expression and to identify a 
possible increase or decrease in their expression level and 
the effects thereof on cell growth. Based on this informati- 
on diagnostic tests or therapeutic treatments may be desig- 

3 0 ned. 

In this application, " PLAG " will be used to indicate 
the involvement of these types of genes in various types of 
tumors, not necessarily restricted to pleomorphic adenomas 
of the salivary glands. The term refers to all members of 
3 5 the PLAG gene family involved in non-physiological 

proliferative growth, and in particular involved in benign 
or malignant tumors. Members of the PLAG gene family show 
homology between zinc finger domains that are typical for 
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the PLAG1 gene* However, according to the invention it was 
furthermore found that even breaks outside the actual genes 
but in the vicinity thereof might result in aberrant growth. 
The term " PIAG gene" is therefore also intended to include 
5 the immediate vicinity of the gene. The skilled person will 
understand that the "immediate vicinity" should be 
understood to include the surroundings of the gene in which 
the breaks or mutations will result in the above-defined 
non-physiological proliferative growth. 
10 The term "Tumorigenesis gene" or "T-gene" will be used 

to indicate all members of this novel PLAG gene family and 
their corresponding translocation or fusion partners, like 
CTNNB1. 

The term "wildtype cell" is used to indicate the cell 
15 not harbouring an aberrant chromosome. "Wildtype" or "nor- 
mal" chromosome refers to a non-aberrant chromosome. 

The present invention provides for various diagnostic 
and therapeutic applications that are based on the informa- 
tion that may be derived from the genes. This information 

2 0 not only encompasses its nucleotide sequence or the amino 

acod sequence of the gene product derived from the gene, but 
also involves the levels of transcription or translation of 
the gene. 

The invention is thus two-fold. On the one hand the 
25 aberration in cell growth may be directly or indirectly 

caused by the physical breaks that occur in the gene or its 
vicinity. On the other hand the aberration in cell growth 
may be caused by a non-physiological expression level of the 
gene. This non-physiological expression level may be caused 

3 0 by the break, or may be due to another stimulus that activa- 

tes or deactivate the gene. At present, the exact mechanism 
or origin of the aberrant cell growth is not yet completely 
unraveled. However, exact knowledge at this time is not 
necessary to define methods of diagnosis or treatment. 
3 5 Diagnostic methods according to the invention are thus 

based on the fact that an aberration in a chromosome results 
in a detectable alteration in the chromosomes' appearance or 
biochemical behaviour. A translocation, for example will 
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result in a first part of the chromosome (and consequently 
of a PLAG gene) having been substituted for another (second) 
part (further referred to as "first and second substitution 
parts"). The first part will often appear someplace else on 
5 another chromosome from which the second part originates. As 
a consequence hybrids will be formed between the remaining 
parts of both (or in cases of triple translocations, even 
more) chromosomes and the substitution parts provided by 
their translocation partners. Since it has now been found 

10 that the breaks occur in a PLAG gene, this will result in 
hybrid gene products of that PLAG gene. Markers, such as 
hybridising molecules like RNA, DNA or DNA/RNA hybrids, or 
antibodies will be able to detect such hybrids, both on the 
DNA level, and on the RNA or protein level. 

15 For example, the transcript of a hybrid will still 

comprise the region provided by the remaining part of the 
gene /chromosome but will miss the region provided by the 
substitution part that has been translocated. In the case of 
inversions, deletions and insertions the gene may be equally 

20 afflicted. 

Translocations are usually also cytogenetically detec- 
table. The other aberrations are more difficult to find 
because they are often not visible on a cytogenetical level. 
The invention now provides possibilities for diagnosing all 

2 5 these types of chromosomal aberrations. 

In translocations markers or probes based on the PLAG 
gene for the remaining and substitution parts of a chromoso- 
me in situ detect the remaining part on the original chromo- 
some but the substitution part on another, the translocation 

3 0 partner . 

In the case of inversions for example, two probes will 
hybridise at a specific distance in the wildtype gene. This 
distance might however change due to an inversion. In situ 
such inversion may thus be visualized by labeling a set of 
35 suitable probes with the same or different detectable mar- 
kers, such as fluorescent labels. Deletions and insertions 
may be detected in a similar manner. 
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According to the invention the above in situ applicati- 
ons can very advantageously be performed by using FISH 
techniques. The markers are e.g. two cosmids one of which 
comprises exon 1 and the upstream region of the PLAG gene, 
5 while the other comprises the last exon and its downstream 
region. Both cosmids are labeled with different fluorescent 
markers, e.g. blue and yellow. The. normal chromosome will 
show a combination of both labels, thus giving a green 
signal, while the translocation is visible as a blue signal 

10 on the remaining part of one chromosome (e.g. 8) while the 
yellow signal is found on another chromosome comprising the 
substitution part. In case the same labels are used for both 
probes, the intensity of the signal on the normal chromosome 
will be 100%, while the signal on the aberrant chromosomes 

15 is 50%. In the case of inversions one of the signals shifts 
from one place on the normal chromosome to another on the 
aberrant one. 

In the above applications a reference must be included 
for comparison. Usually only one of the two chromosomes is 

20 afflicted. It will thus be very convenient to use the normal 
chromosome as an internal reference. Furthermore it is 
important to select one of the markers on the remaining or 
unchanging part of the chromosome and the other on the 
substitution or inverted part. In the case of the PLAG1 gene 

25 of chromosome 8, breaks are usually found in the 5 1 -UTR as 
is shown by the present invention. Probes based on the 5 1 - 
and 3 f -UTR are thus very useful. As an alternative a combi- 
nation of probes based on both translocation or fusion 
partners may be used, e.g. 5 '-UTR of PLAG1 and 3»-UTR of 

30 CTNNB1. 

"Probes" as used herein should be widely interpreted 
and include but are not limited to linear DNA or RNA 
strands, Yeast Artificial Chromosomes (YACs) , or circular 
DNA forms, such as plasmids, phages, cosmids etc.. 
35 These in situ methods may be used on metaphase and 

interphase chromosomes. 

Besides the above described in s i tu methods various 
diagnostic techniques may be performed on a more biochemical 
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level, for example based on alterations in the DNA, rna or 
protein or on changes in the physiological expression level 
of the gene. 

Basis for the methods that are based on alterations in 
5 the chromosome's biochemical behavior is the fact that by 
choosing suitable probes, variations in the length or compo- 
sition in the gene, transcript or protein may be detected on 
a gel or blot. Variations in length are visible because the 
normal gene, transcript (s) or protein (s) will appear in 

10 another place on the gel or blot then the aberrant one(s) . 
In case of a translocation more than the normal number of 
spots will appear. 

Based on the above principles the invention thus rela- 
tes to a method of diagnosing cells having a non-physiologi- 

15 cal proliferative capacity, comprising the steps of taking a 
biopsy of the cells to be diagnosed, isolating a suitable 
PLAG gene-related macromolecule therefrom, and analysing the 
macromolecule thus obtained by comparison with a reference 
molecule originating from cells not showing a non-physiolo- 

20 gical proliferative capacity, preferably from the same 

individual. The PLAG gene-related macromolecule may thus be 
a DNA, an RNA or a protein. The PLAG gene may be either a 
member of the PLAGl family or of the translocation partner 
gene family of which the CTNNB1 gene is the prototype. 

25 in a specific embodiment the diagnostic method of the 

invention comprises the steps of taking a biopsy of the 
cells to be diagnosed, extracting total RNA thereof, prepa- 
ring a first strand cDNA of the mRNA species in the total 
RNA extract or poly-A-selected fraction (s) thereof, which 

30 cDNA comprises a suitable tail; performing a PCR using a 
PLAG gene-specific primer and a tail-specific primer in 
order to amplify PLAG gene-specific cDNA's; separating the 
PCR products on a gel to obtain a pattern of bands; evalua- 
ting the presence of aberrant bands by comparison to wildty- 

35 pe bands, preferably originating from the same individual. 
As an alternative amplification may be performed by 
means of the Nucleic Acid Sequence-Based Amplification 
(NASBA) technique [Compton, J. (1991) Nucleic acid sequence- 
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based amplification. Nature 350,91-92] or variations 
thereof . 

In another embodiment the method comprises the steps of 
taking a biopsy of the tumor to obtain cells to be 
5 diagnosed, isolating total protein therefrom, separating the 
total protein on a gel to obtain essentially individual 
bands, optionally transfer ing the bands to a Western blot, 
hybridising the bands thus obtained with antibodies directed 
against a part of the protein encoded by the remaining part 

10 of the PLAG gene and against a part of the protein encoded 
by the substitution part of the PLAG gene; visualising the 
antigen-antibody reactions and establishing the presence of 
aberrant bands by comparison with bands from wildtype 
proteins, preferably originating from the same individual. 

15 In a further embodiment the method comprises taking a 

biopsy of the tumor to obtain cells to be diagnosed; 
isolating total DNA therefrom; digesting the DNA with one or 
more so-called "rare cutter" (typically "6- or more 
cutters") restriction enzymes; separating the digest thus 

20 prepared on a gel to obtain a separation pattern; optionally 
transfering the separation pattern to a Southern blot; 
hybridising the separation pattern in the gel or on the blot 
with a set of probes under hybridising conditions; 
visualising the hybridisations and establishing the presence 

25 of aberrant bands by comparison to wildtype bands, 
preferably originating from the same individual. 

Changes in the expression level of the gene may be 
detected by measuring mRNA levels or protein levels by means 
of a suitable probe. 

30 Diagnostic methods based on abnormal expression levels 

of the gene may comprise the steps of taking a sample of the 
tumor to obtain cells to be diagnosed; isolating mRNA 
therefrom; and establishing the presence and/ or the 
(relative) quantity of mRNA transcribed from the PLAG gene 

3 5 of interest in comparison to a control. Establishing the 
presence or (relative) quantity of the mRNA may be achieved 
by amplifying at least part of the mRNA of the PLAG gene by 
means of RT-PCR or similar amplification techniques, in an 
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alternative embodiment the expression level may be 
established by determination of the presence or the amount 
of the gene product (e.g. protein) by means of for example 
monoclonal antibodies. 
5 The diagnostic methods of the invention may be used for 

diseases wherein cells having a non-physiological prolifera- 
tive capacity are selected from the group consisting of 
benign tumors, such as the tumors pleomorphic adenomas of 
the salivary glands, lipoblastomas, uterine leiomyomas, and 

10 other benign tumors as well as various malignant tumors, 

including but not limited to sarcomas (e.g. rhabdomyosarco- 
ma) and leukemias and lymphomas. 

As already indicated above, it has been found that in 
certain malignant tumors the expression level of the PIAG 

15 genes is increased. Another aspect of the invention thus 
relates to the implementation of the identification of the 
PLAG genes in therapy. The invention for example provides 
anti-sense molecules or expression inhibitors of the piag 
gene for use in the treatment of diseases involving cells 

2 0 having a non-physiological proliferative capacity by modula- 

ting the expression of the gene. 

The invention thus provides derivatives of the PIAG 
gene and/or its immediate environment for use in diagnosis 
and the preparation of therapeutical compositions, wherein 
25 the derivatives are selected from the group consisting of 
sense and anti-sense cDNA or fragments thereof, transcripts 
of the gene or fragments thereof, antisense RNA, triple 
helix inducing molecule or other types of "transcription 
clamps", fragments of the gene or its complementary strand, 

3 0 proteins encoded by the gene or fragments thereof, protein 

nucleic acids (PNA) , antibodies directed to the gene, the 
cDNA, the transcript, the protein or the fragments thereof, 
as well as antibody fragments. Besides the use of direct 
derivatives of the genes and their surroundings (flanking 
35 sequences) in diagnosis and therapy, other molecules, like 
expression inhibitors or expression enhancers, may be used 
for therapeutic treatment according to the invention. An 
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example of this type of molecules are ribozymes that destroy 
RNA molecules* 

Bsides the above-described therapeutic and diagnostic 
methods, the principles of the invention may also be used 
5 for producing a transgenic animal model for testing pharma- 
ceuticals for treatment of PLAG-related malignant or benign 
tumors. One of the examples describes the production of such 
an animal model* 

It is to be understood that the principles of the 

10 present invention are described herein for illustration 
purposes only with reference to the PIAG1 gene mapping at 
chromosome 8ql2 and the CTNNB1 gene on chromosome 3p21. 
Based on the information provided in this application the 
skilled person will be able to isolate and sequence corres- 

15 ponding genes of the gene family and apply the principles of 
this invention by using the gene and its sequence without 
departing from the scope of the general concept of this 
invention. 

The present invention will thus be further elucidated 
2 0 by the following examples which are in no way intended to 
limit the scope thereof. 

EXAMPLES 

2 5 EXAMPLE 1 

1 . Introduction 

This example describes the isolation and analysis of 
overlapping YAC clones and the establishment of a YAC contig 
(set of overlapping clones) , which spans genomic DNA around 

3 0 the MPS locus and includes the translocation breakpoints, 

t(3 ;8) (p21;ql2) , of pleomorphic salivary glands (Fig. 8). 
Pleomorphic adenomas are benign epithelial tumors 
originating from the major and minor salivary glands. Cyto- 
genetic analysis has indicated that these tumors mainly 
35 display chromosome breakpoints in region ql2 of chromosome 
8. In previous studies we have shown that these breakpoints 
are located in the 9 cM interval between MOS / D8S285 and 
D8S260. Here we report directional chromosome walking stu- 
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dies starting from D8S260 and D8S285, resulting in, respec- 
tively, a rudimental YAC contig of 5 Mb and a highly detai- 
led YAC contig and long-range physical map of about 2 Mb. 
The latter YAC contig has at least double coverage and 
5 consists of 34 overlapping YAC clones isolated from the CEPH 
and ICRF YAC libraries. Their insert sizes were estimated by 
contour-clamped homogeneous electric field (CHEF) gel elec- 
trophoresis. Chromosomal localization of the YACs was inves- 
tigated by FISH analysis. On the basis of YAC insert and 

10 insert-end derived DNA markers and (publicly known) sequence 
tag sites (STSs) , with an average spacing of 65 kbp, as well 
as restriction enzyme analysis, a long-range physical map 
was established for this centromeric region of chromosome 
8ql2. Within the YAC contig, the relative positions of 

15 various known genes and expressed sequence tag sites was 
determined. This YAC contig constitutes the basis for the 
construction of a transcriptional map of this region. FISH 
analysis using YACs and cosmids, corresponding to STSs in 
this YAC contig, already revealed that all but two of the 

20 chromosome 8ql2 breakpoints in the primary tumors that were 
tested, mapped within a 3 00 kb interval between the MOS 
proto-oncogene and STS EM156. 

2 . Materials and methods 
25 2.1. Pleomorphic adenomas of the salivary glands and cytoge- 
netic analysis. 

Primary pleomorphic adenomas of the salivary glands 
were obtained from patients at the time of surgery. Primary 
cultures and chromosome preparations were established and 
30 analyzed as described before (RSijer et al., 1996) . The 

primary tumors with their aberrations used in this study are 
listed in Table 1. Slides for FISH were prepared from cells 
stored in fixative at -2 0°C. 

35 2.2. Fluorescence in situ hybridization (FISH) 

Slides for FISH were aged for 2-10 days, or were trea- 
ted with 2x SSC (pH 7.0) at 37 °C for 3 0 min, and subse- 
quently denatured in 70% formamide in 2x SSC (pH 7.0), at 
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75°C for 2-6 minutes • Cosmid and YAC DNAs (1 /xg) were labe- 
led with biotin-16-dUTP (Boehringer Mannheim) by nick trans 
lation. Labeled DNA was purified through a Sephadex G50 
column (Pharmacia), coprecipitated with 50 mg sonicated 
5 human placenta DNA (Sigma) , and dissolved in hybridization 
solution (50% formamide, 2x BSC pH 7.0, 50 mM sodium phosp- 
hate pH 7.0, 10% dextran sulfate). Hybridization and probe 
detection were carried out as previously described (Roijer 
et al., 1996). The alpha-satellite probe D8Z2 was obtained 

10 from Oncor. Slides were examined in a Zeiss Axiophot 

epif luorescence microscope using the appropriate filter 
combinations. Fluorescence signals were digitalized, 
enhanced and analyzed using the ProbeMaster FISH image 
analysis system (Perceptive Scientific Instruments, Houston 

15 Texas) . Color prints were produced using a Kodak XL 7700 
monochrome continuous printer. 

2.3 DNA preparations. 

Extraction of genomic BNA and the Southern analyses 
20 were performed as describe/ previously (Schoenmakers et al. 
1994). DNA from YACs, costfhids , PCR products and oligonucleo 
tides was labelled using/a variety of techniques. For FISH, 
cosmid clones or inter-^Alu PCR products of YACs were bioti- 
nylated with biotin-ll4-dUTP (Boehringer) by nick translati- 
2 5 on. YAC DNA (100 ng) Ls amplified by inter Alu PCR (Pi: 

CTGCACTCCAGCCTGGG, P*2 : TCCCAAAGTGCTGGGATTACAG ) . After initi 
al denaturation fo/ 5 min at 94 °C, 30 amplification cycles 
were performed ea^h consisting of denaturation for 1 min at 
94 °C, annealing ior 30 sec at 37 °C and extension for 6 min 
at 72 °C, and wx£h a final extension at 72 °C for 10 min. 
Amplified DNA yas purified with QIAQuick PCR Purification 
kit (Qiagen) . /For filter hybridizations, probes were 
radio-labelled with alpha- 32 P-dCTP using random hexamers 
(Feinberg arid Vogelstein, 1984) . In case of PCR-products 
35 smaller than 2 00 bp in size, a similar protocol was applied 
but specific oligonucleotides were used to prime labelling 
reaction^. Oligonucleotides were labelled using 
gamma - 32 Ep ATP . 
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2.4 YAC library screening. 

YAC clones in this paper were isolated from the CEPH 
mark 1 (Albert sen et al. , 1990) and CEPH mark 3 (Chumakov et 
al., 1992) YAC library, made available to us by the Centre 
5 d 1 Etude du Polymorphisme Humain (CEPH). YACs were isolated 
using a combination of PCR-based screening and colony hy- 
bridization analysis (Green and Olson, 1990) . Contaminating 
Candida parapsylosis, which was sometimes encountered, was 
eradicated by adding terbinafin to the growth medium (final 
10 concentration of 25 microgram/ml) . The isolated YAC clones 
were characterized by STS-content mapping, contour-clamped 
homogeneous electric field (CHEF) electrophoresis (Chu et 
al., 1986), restriction mapping and hybridization and FISH 
analysis. 

15 

2.5 Cosmid library screening. 

Cosmid clones were isolated from an arrayed human 
chromosome 8-specific cosmid library (Wood et al. , 1992) 
obtained from Los Alamos National Laboratory (LANL) . 
20 LANL-derived cosmid clones are indicated by their microtiter 
plate addresses. Cosmid DNA was extracted using standard 
techniques involving purification over Qiagen tips (Qiagen) . 

2.6 Pulsed-field gel electrophoresis and Southern blot 

2 5 analysis. 

Pulsed-field gel electrophoresis and Southern blot 
analysis were performed exactly as described by Schoenmakers 
et al. (1994) . Agarose plugs containing high-molecular 
weight YAC DNA (equivalent to about 1 x 108 yeast cells) 

3 0 were twice equilibrated in approximately 25 ml TE buffer (pH 

8.0) for 30 min at 50 °C followed by two similar rounds of 
equilibration at room temperature. Plugs were subsequently 
transferred to round-bottom 2 ml eppendorf tubes and equili- 
brated two times for 3 0 min in 500 microliter of the approp- 
3 5 riate 1 x restriction-buffer at the appropriate restriction 
temperature . 

Thereafter, DNA was digested in the plugs according to 
the suppliers (Boehringer) instructions for 4 h using 3 0 
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units of restriction endonuclease per digestion reaction. 
After digestion, plugs along with appropriate molecular 
weight markers were loaded onto a 1% agarose / 0.25 x TBE 
gel, sealed with LMP-agarose and size fractionated on a CHEF 
5 apparatus (Biorad) for 18 h at 6.0 V/cm using a pulse angle 
of 12 0 degrees and constant pulse times varying from 10 sec 
(separation up to 300 kbp) to 20 sec (separation up to 500 
kbp) . In the case of large restriction fragments, additional 
runs were performed, aiming at the separation of fragments 

10 with sizes above 500 kbp. 

Electrophoresis was performed at 14 °C in 0*25 x TBE. As 
molecular weight markers, lambda ladders (Promega) and 
home-made plugs containing lambda DNA cut with restriction 
endonuclease Hindlll were used. After electrophoresis, gels 

15 were stained with ethidium bromide, photographed, and UV 
irradiated using a stratalinker (Stratagene) set at 12 0 mJ. 
DNA was subsequently blotted onto Hybond N+ membranes 
(Amersham) for 4-16 h using 0.4 N NaOH as transfer buffer. 
After blotting, the membranes were dried for 15 roin at 80 °C, 

2 0 briefly neutralised in 2 x SSPE, and prehybridised for at 
least 3 h at 42 °C in 50 ml of a solution consisting of 50% 
formamide, 5 x SSPE, 5 x Denhardts, 0.1% SDS and 200 
microgram/ml heparin. Filters were subsequently hybridised 
for 16 h at 42 °C in 10 ml of a solution consisting of 50% 

25 formamide, 5 x SSPE, 1 x Denhardts, 0.1% SDS, 100 

microgram/ml heparin, 0.5% dextran sulphate and 2-3 x 106 
cpm/ml of labelled probe. Thereafter, membranes were first 
washed two times for 5 min in 2 x SSPE/0.1% SDS at room 
temperature, then for 30 min in 2 x SSPE/0.1% SDS at 42 °C 

30 and, finally, in 0.1 x SSPE/0.1% SDS for 20 min at 65°Co 
Kodak XAR-5 films were exposed at -80 °C for 3-16 h, 
depending on probe performance. Intensifying screens (Kyokko 
special 500) were used. 

Agarose plugs containing high-molecular weight yeast + 

35 YAC DNA (equivalent to 1 x 10 9 cells ml" 1 ) were prepared as 
described before (Schoenmakers et al., 1994). Plugs were 
thoroughly dialysed against four changes of 25 ml T10E1 (pH 
8.0) followed by two changes of 0.5 ml 1 x restriction 
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buffer before th y were subjected to either pulsed-field 
restriction enzyme mapping or YAC-end rescue. 

2.7 PCR analysis. 
5 PCR amplification was carried out using a Pharmacia 

LKB-Gene ATAQ Controller (Pharmacia/LKB) or a Perkin Elmer 
9600 (Perkin-Elmer Cetus) in final volumes of respectively 
50 and 25 microliter containing 10 mM Tris-HCl pH 8.3, 50 mM 
KC1, 1.5 mM MgCl 2 , 0.01% gelatine, 2 mM dNTPs, 2 0 pmole of 

10 each amplimer, 1.25 units of AmpliTaq (Perkin-Elmer Cetus), 
and 100 ng (for superpools) or 20 ng (for pools) of DNA. 
After initial denaturation for 5 min at 94 °C, 30 amplifica- 
tion cycles were performed each consisting of denaturation 
for 1 min at 94 °C, annealing for 1 min at the appropriate 

15 temperature (see Table 3), and extension for 1 min at 72 C C, 
and with a final extension at 72 °C for 10 min. Results were 
evaluated by analysis of 10 microliter of the reaction 
product on 2% agarose gels. 

2 0 2.8 Generation of STSs from YAC inserts and insert ends. 

YAC-end rescue was performed using a vectorette-PCR 
procedure in combination with direct solid phase DNA sequen- 
cing, as described before (Geurts et al., 1994). STSs speci- 
fic for YAC inserts were generated from inter-Alu PCR pro- 
25 ducts, isolated using published oligonucleotides TC65 or 517 
(Nelson et al., 1989) to which Sail-tails were added to 
facilitate cloning. Figure 10 shows some of these STSs. 
After sequence analysis, primer pairs were developed using 
the OLIGO computer algorithm (Rychlik, 1989) . They were 

3 0 tested on human genomic DNA, basically according to 

procedures described above. 

2.9 Nucleotide sequence analysis and oligonucleotides. 

Nucleotide sequences were determined according to the 
3 5 dideoxy chain termination method using the T7 polymerase 
sequencing kit of Pharmacia/LKB or the dsDNA Cycle Sequen- 
cing System (GIBCO/BRL) . Sequencing results were analyzed 
using an A.L.F. DNA sequencer™ (Pharmacia Biotech) on 
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standard 3 0 cm, 6% Hydrolink , Long Range™ gels (AT Bio- 
chem) • Sequence analysis utilized Lasergene (DNASTAR) and 
BLAST and BEAUTY searches (NCBI; Altschul et al., 1990). All 
oligonucleotides were purchased from Pharmacia Biotech. 

5 

Results 

3.1 Isolation of YAC clones to start chromosome walking. 
According to radiation hybrid mapping data, band 8ql2 

is flanked by markers D8S165, D8S285 proximally, and by 

10 D8S260 distally (Sapru et al. , 1994). The estimated genetic 
distance between these markers is 9 cM (Gyapay et al., 
1994). Using primer sets for these microsatellite markers, 
we selected sets of corresponding YAC clones. These YACs 
were used for initiating the walk towards the 8ql2 translo- 

15 cation breakpoints, meanwhile generating a long-range contig 
based upon STS-content mapping of these. Hence we started to 
construct a 8ql2 centromeric YAC contig (from D8S165 / 
D8S285) and a 8ql2 telomeric YAC contig (from D8S260) . By 
fluorescent in situ hybridisation (FISH) experiments, we 

2 0 could soon show that all but two of the translocation 

breakpoints in primary adenomas mapped within a 1 cM inter- 
val within the centromeric YAC contig (Roijer et al., 1996). 
Hence, the YAC contig of the telomeric side of 8ql2 was not 
characterized in that much detail. The two YAC contigs could 

25 not be linked to each other. 

3.2 Assembly of a YAC contig starting from D8S285 

In previous studies (Roijer et al., 1996) we have 
described the isolation of two overlapping YACs containing 

30 D8S285, namely 935E9 and 166F4. These YACs, together with a 
MOS containing YAC (900gl0157) , were used as a starting 
point for a chromosome walking project to define the 8ql2 
centromeric region, as mentioned in the introduction. Initi- 
ally, chromosome walking was performed bidirectionally . This 

35 allowed us to hook up our 8ql2 centromeric contig to a 
chromosome 8qll YAC contig. Both contigs have the marker 
D8S593 in common- In the subsequent PCR based screening of 
the human CEPH YAC, libraries with D8S108 and D8S166, we 
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isolated M gaYAC 946B7, which together with MegaYAC 93 5E9, 
formed the backbone of our centromeric contig. 

In the bidirectional and subsequent unidirectional 
chromosome walking steps, the following general procedures 
5 were used. First, rescuing and sequencing the ends of YAC 
clones resulted in DNA markers characterizing their left and 
right sides (Table 2). Based on sequence data on the ends of 
the YAC insert, as well as on inter-Alu PCR products, primer 
sets were developed for specific amplification of DNA, 

10 establishing STSs (Table 3). Their location to 8ql2-qter was 
determined by CASH as well as by FISH after corresponding 
YAC and/or cosmid clones were isolated. With the sequenti- 
ally selected and evaluated primer sets, screening of the 
YAC and cosmid libraries was performed to isolate the buil- 

15 ding blocks for contig assembly. Hence, we established a 

8ql2 centromeric YAC contig consisting of 34 overlapping YAC 
clones, covering approximately 2 Mb of DNA (Fig. 1). The 34 
YACs are between 160 kb and 1660 kb long. This YAC contig 
appeared to encompass the chromosome 8ql2 breakpoints of all 

2 0 but two primary adenomas studied. YAC 166F4 for instance, 

was already shown to cross the translocation breakpoint in 
two adenomas with t (3 ;8) (p21;ql2 ) (Roijer et al., 1996). 
Characteristics of the YACs that were used to build this 
contig are given in Table 2. 
25 The YAC contig was constructed in parallel with the 

screening data of Whitehead Institute / MIT Center for 
Genome Research available via http://www-genome.wi.mit.edu- 
/cgi-bin/contig/lookup_contig (Contigs WC8.7 and WC-157) . 

30 3.3 Physical mapping of polymorphic markers, ESTs and genes 
All markers and genes which were or became publicly 
available during our work were STS content mapped on our 
contig and those found positive were subsequently sublocali- 
zed by (primer) hybridization on YAC Southern blots. The 

3 5 markers that were found to reside within the centromeric 

contig presented here were D8S1069 (WI-536) , AFMB055WG9, 
D8S125, D8S108, D8S1661 ' (WI-5459 ) , D8S166, D8S96, D8S1816, 
D8S285, D8S1516 (WI-3862), D8S1828 and D8S165. Furthermore, 
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besides the MOS proto-oncogene (Testa et al., 1988), two 
other genes could be positioned in our centromeric YAC 
contig, namely the YES related proto-oncogene LYN (Corey and 
Shapiro, 1994) and the preproenkephalin gene PENK (Toinfohrde 
5 et al. , 1992). Finally, our contig contains 2 expressed 
sequence tags, namely WI-7754 (D8S1930) , corresponding to 
LYN, and IB2045, not resembling any currently known gene. 
The genes encoding the kappa opioid receptor (OPRK1) , 
(Yasuda et al., 1994) and interleukin-7 (IL7) (Brunton and 
10 Lupton, 1990) are not contained in our YAC contig. 

3.4 Restriction map and putative CpG islands: Construction 
of a rare-cutter physical map from the 2 Mb YAC contig 
in the 8ql2 centromeric region 

15 Southern blots of total yeast plus YAC DNA, digested to 

completion with rare-cutter enzymes BssHII, Kpnl , Mlul, 
NotI, Pvul, Sail and Sfil (see Materials and Methods) and 
separated on CHEF gels, were hybridized sequentially with 
(i) the STS used for the initial screening of the YAC in 

20 question, (ii) pYAC4 right arm sequences, (iii) pYAC4 left 
arm sequences, and (iv) a human Alu-repeat probe (BLUR-8) . 
The long-range restriction map that was obtained in this way 
was completed by probing with PCR-isolated STSs/YAC end 
probes. Restriction maps of individual YAC clones were 

25 aligned, and a consensus restriction map was established. 
The region was searched for CpG islands on the basis of the 
colocalization of sites for these rare-cutter enzymes (Fig. 
1) • 

30 3.5 FISH mapping of 8ql2 breakpoints in pleomorphic 
adenomas 

In previous studies we have mapped the t(3 ;8) (p21;q!2) 
breakpoint within a 1 Mb region flanked by MOS proximally 
and by the genetic marker D8S166 distally. One YAC within 
3 5 this region (166F4) was shown to span the t(3;8) breakpoint 
in two primary pleomorphic adenomas (Roijer et al. (1996)). 
We have now extended these previous observations by analysis 
of additional tumors including cases with different variant 
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translocations (Table 1) • Six of eight primary adenomas with 
8ql2 abnormalities had breakpoints mapping within YAC 166F4 
(Fig. 2A) . Also rumors with other 8ql2 rearrangements than 
the classical t(3;8) had breakpoints within the same YAC, 
5 demonstrating that this is the major breakpoint cluster 
region in pleomorphic adenomas with 8ql2 abnormalities. 

To fine map this region we isolated cosmids from an 
arrayed human chromosome 8-specific cosmid library (obtained 
from Los Alamos National Laboratory; Wood et al. (1992)) 

10 using markers contained within YAC 166F4. Cosmid clones 

containing MPS . EM156 and CH12 9 were isolated and mapped by 
FISH (Fig. 1). The breakpoints in all six cases were 
localized between cosmids CEM1 ( MPS ) en CEM23 (EM156) 
(Fig. 2B and C) . In for example CG 666, CEM1 was transposed 

15 to 8p due to a pericentric inversion of the dicentric marker 
chromosome (Fig. 2B) , while CEM2 3 remained in its normal 
position at 8ql2 (Fig. 2C) . In adenoma CG 682, which has an 
ins(8;3) (ql2 ;p21. 3pl4 . 1) , CEM23 spanned the insertion 
breakpoint (Roijer et al. (1997)). 

20 Two of the eight adenomas tested, CG 650 and CG 787, 

had breakpoints clearly proximal to MPS . To further map 
these breakpoints we isolated two additional sets of YACs 
corresponding to CH37, D8S593 and the recently identified 
XRCC7 gene (Blunt et al. (1995)), respectively. In CG 650 

2 5 the breakpoint was found to reside between ICRF YAC 

900gl0157 and YAC 898G12 (Fig. 1), and in CG 787 the 
breakpoint was found to reside within the XRCC7 containing 
YAC 943G4 (Fig. 2D). In the latter case, which judged from 
cytogenetics had. an ins(8;7) (qll-12 ;p21pl4) , FISH analysis 

3 0 revealed signals on both the der(7) and the der(8) 

chromosomes, demonstrating that this is not a simple 
8:7-insertion but a complex rearrangement with at least two 
breakpoints on 8q, one of which maps within YAC 94 3G4 and 
the other proximal to this YAC. 
35 3.6 Assembly of a YAC contig between D8S260 and D8S507 
Two MegaYACs corresponding to D8S260, were used to 
start the chromosome walk from the telomeric side of chromo- 
some 8ql2.The YAC insert-ends from these two clones (78 3H12 
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and 814A6) were isolated and used for STS content mapping. 
The most telomeric STS (CH31) , is present in two YACs also 
containing D8S510, and hence links the 8ql2 YAC contig to a 
8ql3 specific YAC contig (Koenig et al., 1995). When we 
5 started our physical mapping effort, only one polymorphic 
marker, D8S507, was described in between D8S285 and D8S260 
(Gyapay et al. , 1994) . Initially, we were unable to isolate 
YACs containing D8S507. In stead we used the corresponding 
amplimers to isolate a cosmid containing D8S507 (cosmid 

10 CEM3). An insert-ends of this cosmid clone (EM73) was then 
used to isolate corresponding MegaYACs (872E1, 882D9 and 
957C4) . It is unclear why these YACs were initially not 
recognised upon the YAC library screenings. This might have 
been due to either a less efficient PCR of D8S507 than of 

15 EM73, or to an underrepresentation of the D8S507 YAC-contai- 
ning yeast cells in the primary pools of the specific 
library we screened. The availability of two new polymorphic 
markers, D8S1505 and D8S1515 (WI-6879) allowed to link the 
two separate MegaYAC contigs around D8S260 and D8S507. The 

20 composite contig consists of 23 CEPH MegaYACs and covers 

approximately 5 Mb of genomic DNA (Fig, 3) . The polymorphic 
markers that were tested and found to reside in the telome- 
ric YAC contig are D8S1151 (WI-1155) , D8S260, D8S1075 
(WI-943) , D8S1505, D8S1515 (WI-6879), D8S1723, D8S507 and 

25 D8S1957 (WI-9507) . 

One gene, CYP7, encoding cholesterol 7a-Hydroxylase 
(and previously assigned to 8qll-l2 using both mouse-human 
somatic cell hybrids and FISH (Cohen et al., 1992)) was 
mapped in this contig using a primerset that we developed on 

30 publicly available sequence data (see Table 3). 

The YAC contig was constructed in parallel with the scree- 
ning data of Whitehead Institute / MIT Center for Genome 
Research available via http://www-genome.wi.mit.edu/ 
cgi-bin/ contig/ lookup_contig (Contigs WC8.8 and WC-72 7) . 

35 

4_2_ Discussion 

In this example the physical map of 2 Mb of the 
centromeric region of chromosome 8ql2 composed of 3 4 YAC 
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clones is presented. It is tagged by 31 markers with an 
average interval of 65 kb and was validated by FISH mapping. 
The frequency of chimaerism in the set of YACs studied in 
detail (YAC end rescue and/or restriction enzyme mapping 
5 and/or FISH) was slightly lower (6/26 = 23%) than the 30-40% 
expected (Van Omroen, 1993) . These data provide the most 
detailed available physical map of the 8ql2 region. The map 
was constructed in parallel with the establishment of a 
low-resolution map of the human genome and with the 

10 screening data of the Whitehead Institute/MIT Center for 
Genome Research. It was compared with data available 
electronically via Internet. The order of the markers on the 
physical map is in agreement with the 1996 Genethon human 
genetic linkage map (Dib et al. (1996)). In total, 

15 31 markers/genes were ordered on the contig as follows: 
D8S1069-CH274-TC65 . 2-AFMB055WG9-CH277-CH34-D8S125 — 
D8S108-EM76-D8S1661-CH122-D8S96-D8S166-CH69-CH3 3-D8S1816-PE- 
NK-CH280-CH129-EM156-END2-CH2 83-D8S285-MOS-D8S1516-LYN-D8S1- 
828-IB2045-D8S165-CH37-W1086. 

20 A restriction map with the enzymes BssHII, Kpnl, Mlul, 

NotI, Sail, Sfil, Swal was generated, revealing at least 7 
putative CpG islands. Three of these island correspond to 
previously known genes (PENK, MOS and LYN) , while the others 
might represent the 5' end of unidentified genes so far. Our 

25 2 Mb map integrates cytogenetic, genetic and physical data 
of chromosome band 8ql2. Chromosome band 8ql2 seems to be 
very gene-poor: so far only 4 genes were shown to reside in 
both contigs (CYP7, PENK, MOS and LYN). This is consistent 
with the view that Giemsa-positive bands are chromosome 

3 0 regions rather low in G+C content and poor in genes (Craig 
and Bickmore, 1993). Chromosome 8ql2 is partially conserved 
in synteny with mouse chromosome 4 (Mock et al., 1996). 
Comparative gene mapping however did not reveal any mouse 
gene or marker not present on our physical map. 

35 our physical map contains at least one disease locus, 

i.e. the pleomorphic adenoma translocation breakpoint. FISH 
analyses using YACs and cosmids revealed that the majority 
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of breakpoints clustered within a 3 00 kb region in both 
adenomas with the recurrent t(3;8) and in those with 
different sporadic 8ql2 translocations. A cosmid from this 
region was also shown to span the breakpoint in an adenoma 
5 with an ins(8;3) (Roijer et al. (1997)). This region is 

therefore likely to harbor the pleomorphic adenoma gene. The 
breakpoint region contains one known gene ( MPS ) , one 
polymorphic marker (D8S285) , and three new STSs (EM156, END 2 
and CH283). BLAST searches of these STSs revealed that CH283 

10 displayed sequence identity with a publicly available EST 
(expressed sequence tag) . 

Two of the eight tumors in the present series had 
breakpoints outside the 3 00 kb breakpoint region. By FISH we 
could demonstrate that the breakpoints in both cases were 

15 clearly proximal to MPS . In one case the breakpoint was 

mapped between MPS and YAC 898G12, and in the other case it 
was found to reside within YAC 94 3G4. The latter YAC maps at 
least 2 Mb centromeric to MPS . Since the two breakpoints are 
clearly different and also separate from those located in 

20 the 300 kb breakpoint region it is unlikely that they affect 
the same gene. However, we can of course not rule out the 
possibility that these cases have additional rearrangements 
that may have escaped detection with the DNA probes used, 
and that they in fact all have rearrangements of a common 

25 gene in the breakpoint region. 

Recently, we have obtained molecular evidence for an 
additional breakpoint region distal to our telomeric contig 
(Roijer et al. (in manuscript)). Based on FISH using YACs 
derived from 8ql3-2l we have estimated that these 

3 0 breakpoints are about 15 to 2 0 cM telomeric to the 8ql2 
breakpoint cluster region. Cytogenetically , these 
breakpoints are very difficult, or impossible to distinguish 
from the classical 8ql2 breakpoints. This is also true for 
the breakpoints located centromeric to MPS . Collectively, 

3 5 our findings thus suggest that there are several genes in 
the proximal part of 8q that are affected by chromosome 
rearrangements in pleomorphic adenomas. 
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In addition to pleomorphic adenomas there are also a 
few other types of solid tumor with structural 
rearrangements involving 8qll-l3, namely lipoblastomas (Dal 
Cin et al. (1994); Sawyer et al. (1994)), rhabdomyosarcomas 
5 (Mitelman (1994)) and renal cell carcinomas (Elfving 
(1996)). It should also be mentioned that pleomorphic 
adenomas of the lacrimal glands show recurrent 
rearrangements of 8ql2 (Hrynchak et al. (1994)), including a 
t(3;8) (p21;ql2) ♦ Because of the well known histopathological 
10 and clinical similarities between pleomorphic adenomas 
originating from the salivary and lacrimal glands it is 
likely that the 8ql2 abnormalities in these tumor types 
affect the same gene. As for the other tumor types mentioned 
it remains to be shown whether the rearrangements in these 
15 cases also affect this gene. Preliminary FISH analysis of 
lipoblastoma with an 8ql2 translocation breakpoint have 
shown that the breakpoint, as in pleomorphic adenomas, is 
distal to PK)S (unpublished observations) . Unfortunately no 
markers telomeric to MPS could be tested. FISH studies are 
20 now in progress to resolve these critical questions. 

Finally, a second YAC contig was established between 
D8S260 and D8S507, consisting of 23 CEPH MegaYACs and cove- 
ring approximately 5 Mb. Both contigs represent about 75% of 
human chromosomal band 8ql2 , which extends over approximate- 
25 ly 9 Mb of DNA. They will provide molecular access to new, 
previously unidentified genes from this region and especial- 
ly to those directly affected by the chromosome 8ql2 aberra- 
tions in pleomorphic adenomas of the salivary gland. 

30 Figure legends 

Figure 1 . 

Physical contig and STS marker map in proximal human 
8ql2. The upper line indicates the chromosome, with the 
3 5 telomere to the left and the centromere to the right. DNA 
markers and genes are ordered across the top of the figure. 
Genes are underlined and previously known markers are indi- 
cated in bold. YAC clones are represented as lines together 
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with their names and are described in Table 2. STS locations 
are indicated by dashed lines. The length of the solid line 
indicates its approximative size. The size of YAC 3 91H2 and 
392A6 is too large to fit within the constraints of the map, 
5 suggesting chimaerism. In these cases, the lenght of the YAC 
corresponds only to the markers it contains. YAC end clones 
used as STS are represented by arrows. Below the contig the 
consensus restriction maps for BssHII (B) , Kpnl (K) , Mlul 
(M) , NotI (N) , Pvul (P) , Sail (S) and Sfil (Sf) are indica- 
10 ted. Verical arrows indicate putative CpG islands, defined 
as the colocalization of sites for (a) K , M, N; (b) K, M, P, 
S, Sf; (c) K, M, P, Sf; (d) K, N, S; (e) K, N, S; (f) B, N, 
S; (g) B, K, N, Sf. 

The pleomorphic adenoma gene region is indicated by a gray 
15 shaded box. 

Figure 2 . 

Physical contig and STS marker map in distal human 
8ql2. The upper line indicates the chromosome, with the 

20 telomere to the left and the centromere to the right. DNA 
markers and genes are ordered across the top of the figure. 
Genes are underlined and previously known markers are indi- 
cated in bold. YAC clones and cosmid CEM3 are represented as 
lines together with their names. YAC end clones used as STS 

25 are represented by arrows. 

Figure 10 . 

STSs used to generate the 3 00 kb cosmid contig mapping 
at chromosome 8ql2 and encompassing PLAG1 • 

30 
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TABLE 2: 


Analysis of YAC clones in 


the centromeric contig 




CEPH code 


Size (kb\ 


Landm. left Acc. No. 


Landm. right Acc. No. 


Chimeric 


518G7 




CH131 




Y (left) 


518G12 




CH132 




ND 


770B2 


1410 


CH281 


CH286 


ND 


878F2 


800 


CH282 


CH287 


ND 


255H10 


390 


CH273 


CH272 


ND 


531D4 


430 


CH274 


CH277 


ND 


15H11 


430 






ND 


291pl2 


410 


CH29 


CH34 


Y (left) 


47E9 


370 






ND 


26F6 


460 






ND 


916F12 


1230 


CHI 22 


CH128 


ND 


388D6 


430 


CH69 


CH100 


ND 


297E11 


350 


CH280 


CH285 


ND 


253H7 


160 


CH279 


CH284 


ND 


143D5 


260 


CH278 


CH283 


ND 


391H2 


620 






Y 


392A6 


600 






Y 


84E12 


500 


CH289 


CH290 


ND 


166F4 


700 


CH67 


CH73 


ND 


164H5 


375 


END2 


END6 


ND 


206F11 




END3 


END7 


Y (right) 


253C7 


620 


END4 


END8 


ND 


295A4 




CH68 


CH74 


Y (left) 


917E11 


1380 


END1 


END5 


ND 


946B7 


1370 


CH123 


CHI 29 


ND 


935E9 


1660 


CH33 


CH37 


ND 



Note. YAC clones were isolated from CEPH YAC libraries as described under Materials and 
Methods. ND, not detected by methods used (FISH, YAC end rescue, restriction enzyme 
mapping). GenBank accession numbers are given. 
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TABLE 3: 8ql2 STS primer sequences, annealing temperatures and 
expected PCR product sizes 

Amplim ers on published sequences centromeric contig 



STS name 



Nucleotide sequence 5' - 3' 



Product T a (°C) 
size (bp) 



LYN 



bfos 



GGAAGGAAAG^GAAAGGAGA 
GGTTTGGGTGTTTGGTGTG 



GAGCTCAAGGTAGCAAGGCT 
AACTGTCqfcCAGTGCGG 



194 



203 



60 



60 



PENK 



TAATAAAGGAGCCAGCTATG 
ACATCTGATGTAAATGCAAGT 



75-83 



58 



D8S96 



D8S108 



D8S125 



TCTCTACCTCGA CATACTCCTGC 
GTCAGAAGTGCCAATCAGACTG 



CAAA'CCTTGAATTACAAAACAG 
TGrPAATATTAGACCACCTTTC 



TGTC 



113 



116 



TTOTGTTTCCTGGTCTCAGTAGC 144 
CACATGCATATGTGTATGTGTGTG 



60 



58 



62 



D8S165 



ACAAGAGCACATTTAGTCAG 

/gcttcatttttccctcta G 



138-152 



58 



D8S166 



jATTGTGTCATTGCACTCCA 
/ ACAAGGAAGTTCCTTTTTGG 



116 



58 



D8S285 



GCATCACACAGAATCTTTG 
ATGGGTTTATGGCCTTTAC 



108-124 60 



D8S1069 



AGCACAGTGGATATTTTTAGGC 
GGGGCTCACACAGAAGTTAA 



221 



60 
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GTCCCCCATCAACATGtTG 
CTCATCTTGTTTTCATAGTGTTCC 
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60 



D8S1661 



TCTCATGCATGTTTuCTGTTG 
GTTGGGGTCATTAA'ACACTAGTCA 



126 



60 



D8S1816 



D8S1828 



TGCACCCTTAAAAAGCATCG 
ACTTGCGAACATGGGATCAC 



AGTGCTGTTTTTACTTCTGTACG 
GCAAGACTCTGTCTCAGGA 



A^FMB055WG9 CTCCCAACCGACCCGAC 



143-147 



198-238 



218 



TGAAAACCATAATCTCTGATGTTGC 



60 



60 



60 



Amplimers on newly isolated/ STSs centromeric conti g 



STS name 



Nucleotide sequence 5' • 3' 



Product 

size (bp) 



T a (°C) 



CH33 



CCTTTGGCTGGGGTTTATA 
GGCCTATGAAGCAAGAGAG 



168 



60 



CH34 



CH37 



gtacccagaaggcaagtaa 
gtga!\aaggcagaaattag 



TTGCATGAGAATGGAAATG 
GGGGTTACTTTCCTTTTGT 



145 



176 



60 



58 



CH69 



AGTGCTTACAATAGGGTGAG 
CCATCCAGAAAGACCATAAT 



336 



60 



CH122 



TTJTGTCTTTGATTTTTATGG 
TGACCAACATACTGCCTA GT 



250 



57 



CH129 



citgaatcccagaacaatata 
a'gggtaagtatgtcctttaa 



110 



60 
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CH273 



ATAATGTTGAGACTTTGAGA 
AAATGTTTATCCTAATTGTA 



157 



58 



CH274 



CAGGTGAGTGGATGGTGTAA 
CAAGGGGAGACCAAATCATC 



241 



60 



CH277 



CH280 



EM76 



EM156 



AATGGCTATGAGpTTGTTTT 
CACATCCTTTCATTTTAGCA 



GGG CTGATGTyCC ATT A A CT 
GCTTCAACACCAAAAATGCT 



CTGGGAAGAGATCAAAATTC 
TAAAGAGAGAGCACCACAAA 



AGTAGCAGCAGCAACAGTCA 
TGCGCTAT/TCAGAGAAGATG 



122 



163 



220 



160 



58 



58 



60 



60 



EM216 



END1 



END2 



CAGTCAGTTCCAGAGGTCATTT 
TAGGGAGGGCTTTAATAGTGTT 



gctcagttcactcctaccc 
caacc/Zaccactaaaaacg 



GTGATrrTTACA GCCATTTT 
TGTAATTTTCAACCAGAAG 



255 



161 



91 



55? 



60 



50 



TC65.2 



TACAAACCGGGAGAAAACAG 
TTAckGCATTTCCGATTTTG 



232 



58 
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Amplimers on published sequences centromaric contig 



STS name 



Nucleotide sequence 5' •! 3' 




Product T a (°C) 
size (bp) 



CYP7 



TTGAC VVV 1 A AATATGATAGGTAT 1600 
ACT1T 1 ATTTCTGA AAGATGA ATCA 



55 



D8S260 



D8S507 



D8S1113 



D8S1505 



D8S1723 



AGGCTTGCCAGATJAAGGTTG 
GCTGAAGGCTGTTCTATGGA 



TTCCTCAGAGCA'GTTCAAAG 
TAATCTTGCCCCAGTGAGAG 



ATGCAAAGATGAACCAGGAA 
CCCTGGACTGATGGTACTTG 



AGCTCA ATG GC ACGTCCTTT 
ACTGCTGACTCAGAGCCTGG 



187-213 



155-169 



216 



GGATTTTAAGTTTCTACAAAGGGA 328 
ACAATTTCGATGAAGTGTTCACC 



235-243 



Amplimers on newly isolated STSs telomeric conti" 



STS name 



Nucleotide sequence 5' - 3' 



60 



60 



62 



58 



60 



Product T a (°C) 
size (bp) 



CH31 



TCACA'GAATAATACAGGAT 
GATCACTGATGATACTAGG 



220 



58 



CH32 



ATTTGCCTC AGTGTTGCA G 
ATTT/TCAGGAGGTCAGGGA 



245 



62 



CH35 



CAAAATGACTTATGCTGAA 
TCTA.TACAGGGCATTGTGA 



165 60 
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EM73 




351 



60 



EXAMPLE 2 

5 1 . I nt r oduc t ion 

In this example the identification of a member of the 
PLAG gene family that appears to be of pathogenetical rele- 
vance is described. In previous experiments, it was found 
that the segment between MPS and PENK on human chromosome 
10 8ql2 harbors the recurrent chromosome t(3;8) (p21;ql2) 

breakpoint frequently found in pleomorphic adenomas of the 
salivary glands. Using a positional cloning approach, the 
PLAG1 gene was identified and its genomic organization 
characterized . 

15 The largest cytogenetic subgroup of pleomorphic adenoma 

of the salivary glands carries chromosome 8ql2 aberrations 
with 3p21 as preferential translocation partner. Here we 
demonstrate that the t (3; 8) (p21;q!2) results in promoter 
swapping between PIAG1 . a novel, development ally regulated 

20 zinc finger gene on 8ql2, and the constitutively expressed 
gene for 6-catenin (CTNNB1) , a protein interface functioning 
in adherens junctions and the WG/WNT signalling pathway. 
Fusions occur in the 5 1 -non-coding parts of both genes, 
exchanging regulatory control elements while preserving the 

25 coding regions. Due to the t (3 ;8) (p21;q!2) , PIAG1 is activa- 
ted and expression of CTNNB1 down-regulated. Activation of 
PLAG1 was also observed in an adenoma with a variant trans- 
location t(8;15). The results indicate that PLAG1 activation 
due to promoter swapping is a crucial event in salivary 

30 gland tumorigenesis . 

Recently, molecular insight was obtained into the 
genetic basis of benign tumors by the discovery of a common 
genetic denominator in tumors of different types but with a 
particular cytogenetic profile (Schoenmakers et al. (1995); 

35 Ashar et al. (1995)). Molecular analysis of recurrent 

chromosome 12ql3-15 aberrations in pleomorphic adenoma of 
the salivary glands, lipoma, uterine leiomyoma, hamartoma of 
lung and breast, fibroadenoma of the breast, angiomyxoma, 
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and endometrial polyps revealed the consistent involvement 
of the high mobility group protein gene HMGIC; the first 
"benign oncogene" . 

Evidence is mounting that the same is likely to hold 
5 true for the closely related family member HMGIY, which is 
located on the short arm of chromosome 6. Both genes belong 
to the high mobility group (HMG) protein gene family (Bustin 
et al. (1990). Their corresponding proteins possess three 
amino-terminal AT hooks, through which the proteins are 

10 assumed to bind to A/T-rich DNA sequences, and an acidic 
tail in the carboxy-terminal region. Functionally, they act 
as architectural factors in the nuclear scaffold, critical 
for the correct assembly of stereospecif ic transcriptional 
complexes (Wolffe (1994) ) . 

15 The genetic aberrations frequently result in the 

separation of the three AT hooks from the acidic tail, 
although all the pivotal changes in the high mobility group 
protein genes that can lead to aberrant growth control are 
not yet known. It should be noted that a variety of chromo- 

20 some segments can act as translocation partner of 12ql3-15 
but each tumor type seems to have a preferential transloca- 
tion partner. At present, only the preferential t(3;12)(q27- 
28;ql5) translocation of ordinary lipoma has been 
characterized (Petit et al. (1996)). The gene affected on 

25 chromosome 3 is the LPP gene, which encodes a protein 

containing three LIM domains. LIM domains have been found in 
a growing number of proteins in mammals, amphibians, flies, 
worms, and plants and act as modular protein-binding 
interfaces (Schmeichel & Beckerle (1994)). LIM proteins 

3 0 mainly have a function in cell signalling and developmental 
regulation and include, for instance, transcription 
regulators, proto-oncogene products, and adhesion plaque 
constituents. The preferential t(3;12) in lipoma results in 
the formation of an HMGIC /LPP fusion transcript encoding a 

35 hybrid protein consisting of the three DNA binding domains 
of HMGI-C and the LIM domains of the protein encoded by LPP . 

In an approach to systematically unravel the molecular 
pathog nesis of benign tumors, the inventors have focused on 
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other cytogenetic subgroups of these, since their mere 
existence indicate the presence of additional critical 
genes. It is of interest to establish whether such genes are 
functionally similar or different as compared to the high 
5 mobility group protein genes HMGIC and HMGIY . In this con- 
text , the inventors have now molecular ly characterized the 
largest cytogenetic subgroup of pleomorphic adenomas. This 
type of neoplasm is an epithelial tumor occurring primarily 
in the major and minor salivary glands. It is by far the 

10 most common type of salivary gland tumor, accounting for 
almost 50 % of all tumors in these organs (Seifert et al. 
(1990)). Pleomorphic adenomas are almost exclusively benign 
tumors, which only rarely undergo malignant transformation. 
They show a marked histological diversity with epithelial 

15 and myoepithelial cells arranged in a variety of patterns in 
a matrix of mucoid, myxoid, chondroid and, on rare 
occasions, even osteoid tissues. Cytogenetically , 
pleomorphic adenomas are characterized by recurrent 
rearrangements, in particular reciprocal translocations with 

20 consistent breakpoints at 3p21, 8ql2 and 12ql3-l5 (Mitelman 
(1994)). In addition to the cases with abnormal karyotypes, 
there is also a subgroup of tumors with apparently normal 
karyotypes (30 to 50% of the cases) . Abnormalities of 8ql2 
are most common and are found in about 60% of the cases with 

25 abnormal stemlines, whereas rearrangements of 3p21 and 
I2ql3-15 are found in about 3 0% and 20% of the cases, 
respectively. The most frequent and characteristic 
abnormality so far observed in pleomorphic adenomas is a 
reciprocal t (3 ;8) (p21;ql2) translocation (Mark et al. 

30 (1980)). While chromosome 3p is the preferential 

translocation partner, most human chromosomes have been 
found as translocation partner of 8ql2. Rearrangements of 
8ql2 are often found as the sole anomalies, indicating that 
they represent primary cytogenetic events of possible 

35 pathogenetic importance. 

Here, the inventors describe the positional cloning of 
the 8ql2 translocation breakpoint as well as the identifica- 
tion and characterization of the genes on chromosome 8ql2 
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and 3p21 disrupted by the t(3;8) translocation . The gene on 
chromosome 8ql2 is a novel zinc finger gene, which we have 
designated PIAGl , whereas the gene on 3p21 is CTNNB1 which 
is the translocation partner, the gene for 6-catenin, a 
5 protein with an established role in cell adhesion and signal 
transduction (Peifer (1993) )/. By Southern blot and RACE 
analysis, we pinpoint the breakpoints within the introns of 
the 5'-noncoding regions of both genes and show that well- 
defined fusion transcripts are expressed in the tumor cells. 

10 We also demonstrate that the t(3;8) translocation results in 
specific disruption of the transcriptional control of the 
two genes, leading to activation of PIAGl and down- 
regulation of CTNNB1. A pleomorphic adenoma with 8ql2 but 
not 3p21 involvement has also been studied to evaluate the 

15 phenomenon of disruption of transcriptional control in a 
broader perspective. 

2_=_ Materials and methods 

2.1 Tumor material and chromosome analysis 

20 Primary pleomorphic adenomas of the salivary glands 

were obtained from patients at the time of surgery. Primary 
cultures and chromosome preparations were made and analyzed 
as described by Roijer et al. (1996) . Primary pleomorphic 
adenomas, including CG368, CG580, CG644, CG752, CG753 and 

25 T9587, which all carry the recurrent t (3 ;8) (p21;ql2) as the 
sole anomaly, were selected for molecular analysis, as well 
as CG682, showing an ins (8 ; 3 ) (q!2 ;p21 . 3pl4 . 1) , and CG588 
which carries a t (8 ; 15) (ql2 ;ql4 ) . CG tumors are obtained 
from Department of Pathology, Goteborg University, 

3 0 Sahlgrenska University Hospital, S-413 4 5 Goteborg, Sweden. 
The tumor identified as T9587 is obtained from University 
Hospital Ghent, Belgium. FISH analysis was performed as 
previously described (Roijer et al. (1996)). Slides were 
examined in a Zeiss Axiophot epif luorescence microscope 

35 using the appropriate filter combinations. Fluorescence 
signals were digitalized, enhanced and analyzed using the 
ProbeMaster FISH image analysis system (Perceptive 
Scientific Instruments, Houston, Texas) . Color prints were 
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produced using a Kodak XL 77 00 monochrome continuous 
printer. 



2.2 Preparation and analysis of DNA and RNA 

5 Extraction of genomic DNA and Southern blot analysis 

were performed as described previously (Schoenmakers et al. 
(1994)). Total RNA was extracted from biopsy samples using 
the TRIZOL (Gibco/BRL) method and Northern analysis was 
performed according to standard procedures (Sambrook et al. 

10 (1989)). DNA from YACs, cosmids, PCR products, and 

oligonucleotides was labelled using a variety of techniques. 
For FISH, cosmid clones or inter-Alu PCR products of YACs 
were biotinylated with biotin-ll-dUTP (Boehringer) by nick 
translation. For filter hybridizations, probes were radio- 

15 labelled with a- 32 P-dCTP using random hexamers (Feinberg & 
Vogelstein (1984)). In case of PCR-products of T-genes of 
the invention smaller than 200 bp in size, a similar 
protocol was applied, but T-gene specific oligonucleotides 
were used to prime labelling reactions. Oligonucleotides 

2 0 were labelled using y- 32 P-ATP. 

2.3 YAC, cosmid, phage, and cDNA libraries 

YAC clones in this paper were isolated from the CEPH 
mark l YAC library (Albertsen et al. (1990)), using a 

25 combination of PCR-based screening and colony hybridization 
analysis (Green & Olson (1990)). YAC DNA was isolated and 
characterized as described before (Schoenmakers et 
al.(1995); Schoenmakers et al. (1994)). Cosmid clones were 
isolated from an arrayed human chromosome 8-specific cosmid 

30 library (Wood et al. (1992)) obtained from Los Alamos 

National Laboratory (LANL) . LANL-derived cosmid clones are 
indicated by their unique microtiter plate addresses. Phage 
clones were derived from a genomic library constructed with 
Li-14/SV40 DNA (Schoenmakers et al. (1994)) in XFIXII 

35 according to standard procedures. Cosmid and phage DNA was 
extracted using standard techniques involving purification 
over Qiagen tips (Qiagen) . Positive cDNA clones were 
identified in a mixed poly-dT/random-primed library in Agtli 
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constructed from human fetal kidney (Clontech) . Library 
screening was performed by plaque hybridization using 
various DNA probes, derived from subsequently isolated cDNA 
clones , according to the manufacturer's instructions. 

5 

2 . 4 DNA sequencing and computer analysis 

Nucleotide sequences were determined according to the 
dideoxy chain termination method using the T7 polymerase 
sequencing kit of Pharmacia /LKB or the dsDNA Cycle Sequen- 
10 cing System (GIBCO/BRL) . Sequencing results were analyzed 
using an A.L.F. DNA sequencer 114 (Pharmacia Biotech) on 
standard 30 cm, 6% Hydrolink*, Long Range™ gels (AT Bio- 
chem) . Sequence analysis utilized Lasergene (DNASTAR) and 
BLAST and BEAUTY searches (NCBI) . 

15 

2.5 PCR amplification of genomic DNA 

PCR amplifications were carried out essentially as 
described before ( Schoenmaker s (1994)). The following 
amplimers wefce used to generate a PLAG1 exon 1 probe (5'-CAA 

20 TGG CTG CTG/ GAA AGA GG-3 1 and 5 ' -CCC GTC CGC CGC CTC TAC 
ACC-3'), a/pLAGl ORF probe (5*-CGT AAG CGT GGT GAA ACC AAA 
C-3» and AGG GTC GTG TGT ATG GAG GTG A-3 1 ) , a PLAG1 3 • -UTR 
probe (5V-ACA TGG CAT TTC GTG TCA CT-3 ' and 5 • -CCA CAA TGG 
CTC TAG A/T-3') and a CTNNB1 exon 1 probe (5' -TGT GGC AGC AGC 

2 5 GTT GGC^CCG GC-3 • and 5«-CTC AGG GGA ACA GGC TCC TC-3 1 ) . 

2.6 Rapid amplification of cDNA ends (RACE) 

Rapid amplification of 3 1 cDNA-ends (3 • -RACE) was 
performed using af slight modification of part of the 
30 GIBCO/BRL 3 ■ -ET Protocol . For first strand cDNA synthesis, 
adapter primer ffAP2) AAG GAT CCG TCG ACA TC(T)17 was used. 
For both initial and secondary rounds of PCR, the universal 
amplification ptimer (UAP2) CUA CUA CUA CUA AAG GAT CCG TCG 
ACA TC was used as "reversed primer" • In the first PCR round 
35 the following /specif ic "forward primers" were used: i) 5 1 - 
CAA TGG CTG CTG GAA AGA GG-3 • (exon 1) or ii) 5 1 -AGA ATT TGG 
GCC TCA GAC AAG ATA- 3 1 (3 '-UTR, exon 5). In the second PCR 
round the following specific forward primers (nested primers 
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as compared to those 7 used in the first round) were used: i) 
5" -CAU CAU CAU CAU'GGC CGG AGG GAG GAT GTT AA-3 • (exon 1) or 



^ii) 5.-C 



10 



'-CAU CAU CAU CAU ATT GTC CTG GGT TGA TTA TGC AT- 3 1 

^ x - , exon 5). CUA/CAU-tailing of the nested, specific 

5 primers allowed the use of the directional CloneAmp cloning 
system (GIB^O/BRL) . 

For 5 1 -RACE experiments, the Marathon cDNA Amplificati- 
on kit (Clontech) wai used according the manufacturer's 
instructions with mi/nor modifications. The 5 • -untranslated 
end of the normal PLAG1 transcript as well as the chimeric 
transcripts were isolated by 5 » -RACE. First strand placenta 
or adenoma cDNA respectively, was synthesized from 5 jug 
total RNA using the MV2 primer (5«-CTG CAC TTG ACC CAC CCC 
TTG GAT- 3 1 ) located in exon 5. The ds cDNA was ligated to 
the adaptor and amplified using the anchor primer API and 
the MVS primer (B'-CAG GAG AAT GAG TAG CCA TGT GC-3') also 
located in exon 5. A second round of PCR was performed using 
the anchor primer and the MV6 primer (5«-TGC ACT TGT AGG GCC 
TCT CTC CTG-3*) located in exon 4. The final PCR products 
20 were purified out of agarose gel and cloned into the pCRIl 
vector (Invitroien) . 




2 . 7 RT-PCR 

Total RNA (5 fig) was reverse-transcribed using Super- 
25 script II reverse transcriptase (GIBCO BRL) and oligo d(T) 
primers according to the redbmmended conditions. 0.2 5 /ug of 
the resulting cDNA was subject to amplification using a 
variety of primer sets. Th4 amplification conditions for the 
CTNNB1 ^PLAGl fusion transcripts were 3 0 cycles at 94 °c for 
30 10 sec and 68 °C for 1 mi/n in a final volume of 50 jil using 
the Expand long template/ PCR system (Boehringer Mannheim). 
The first round PCR was/ carried out with the CTNNB1 primer 
5 '-TGT GGC AGC AGC GTJ GGC CCG-3 » (CAT-UP) and the PLAG1 
primer 5'-CAG GAG AAT/ GAG TAG CCA TGT GC-3' (MVS). The 
35 second round was performed on a 20 fold diluted sample with 
the CTNNB1 primer 5V-ACG GAG GAA GGT CTG AGG AGC AG-3 • 
(NECAT-UP) and the PLAG1 primer 5 • -TGC ACT TGT AGG GCC TCT 
CTC CTG-3 ' (MV6) . jo amplify the reciprocal PLAGl/CTNNBi 
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fusion transcript two rounds of PCR amplification were 
performed with 3 0 dycles at 94 °C for 3 0 sec, 63 °C for 3 0 
sec and 72 °C for L min in a final volume of 50 pi. The 
^ yytixBt round was okrried out with the PLAG1 primer 5*-CAA TGG 
CTG CTG GAA AGA GG-3 ' (START-UP) and the CTNNB1 primer 5'- 
AAG GAG CTG TGG /TAG TGG CAC-3 • (CAT3) • The second round was 
performed on a 20 fold diluted sample with the PLAG1 primer 
5'-GGC CGG AGG GAG GAT GTT AA-3 1 (START-RACE) and the CTNNB1 
primer S^GCC OCT TTT CTG TCT GGT TCC A-3 ' (CAT3NEST) • 

10 

3_s. Results 

3 . 1 The chromosome 8ql2 breakpoint cluster region in pleo- 
morphic adenoma 

As part of a positional cloning effort to isolate the 
15 gene (s) affected by the t (3 ; 8) (p21 ;ql2) in pleomorphic 

adenomas, we constructed a yeast artificial chromosome (YAC) 
contig consisting of 34 overlapping YACs, spanning about 2 
Mb within band 8ql2 (to be published elsewhere) . A long 
range STS and rare cutter physical map of this region was 
20 also developed. In previous experiments the t(3;8) break- 
point was mapped to a 1 Mb region flanked by MPS as proximal 
and by D8S166 as distal marker. One YAC within this region 
was shown to span the t(3;8) breakpoint in two tumors (CG588 
and CG644) . To further narrow down the breakpoint region we 

2 5 isolated cosmids corresponding to landmarks within this and 

other YACs in our contig (Fig. 3 A) . By FISH, we could 
demonstrate that the breakpoints in four out of four adeno- 
mas tested (CG580, CG588, CG644 and CG682) clustered within 
a 3 00 kb subregion located between MPS and STS CH12 9 (Fig. 

3 0 3A) . Two cosmids from this region, CEM2 3 and CEM48 , were 

shown to span the breakpoints in adenomas CG64 4 (Fig. 3B) 
and CG682 (data not shown), respectively. To refine the 
distribution of breakpoints within this 300 kb subregion, we 
developed a contig consisting of 29 phage and cosmid clones 
35 (Fig. 3A) . 



3.2 Identification and characterisation of the PLAG1 gene 
in the 8ql2 breakpoint region 
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.While initiating experiments to identify candidate 
genes mapping within the 300 kb genomic segment harbouring 
the adenoma breakpoints, BLAST searches (Altschul et al. 
(1990)) of newly obtained STSs mapping within the 300 kb 
5 contig revealed that the right end (CH283) of YAC clone 
143D5 displayed sequence identity with a publicly available 
EST (expressed sequence tag) (GenBank Accession number 
D59273). its position in the contig raised the possibility 
that this EST belonged to the candidate pleomorphic adenoma 
10 gene (£LAG1) , we were searching for. 

In initial Northern blot experiments using a PCR probe 
corresponding to this Particular YAC end, a 7.5 kb 
transcript was detected. A combination of sequential scree- 
nings of a human fetafl kidney cDNA library as well as 5«- 
15 and 3 • -RACE experiments led to the isolation of a composite 
cDNA of 7313 nucleotides (GenBank accession number U65002) . 
This cDNA contains /an open reading frame (ORF) of 1500 bp 
starting with the ^ATG at position 481-483 (Fig. 4A) . An in 
frame stop codon ^(TAG) is present 9 nucleotides upstream of 
20 this ATG. The deduced amino acid sequence reveals seven 
canonical C2H2 zinc finger domains (Fig. 4B) and a non- 
finger region of 259 amino acid residues representing the 
carboxy-terminufe of the deduced protein. The zinc finger 
motifs including the linker sequences are between 28 and 35 
amino acids long. The cysteine (C) and histidine (H) residu- 
es are presenf in their characteristic positions in each 
finger. The typical phenylalanine (F) and leucine (L) resi- 
dues in finder 4 and 6 are lacking. Strictly spoken, the 
deduced protein is not a Kruppel zinc finger protein, since 
30 it does not/ contain the characteristic H/C linker (consensus 
sequence TGEKPYK) in between the zinc fingers (Bellefroid et 
al. (1989)/) . only the seven amino acids between finger 1 and 
2 resemble the H/C linker (TGERPYK) . The amino-terminal 
region contains two nuclear localization signals (KRKR and 
3 5 KPRK) . The carboxy-terminus is serine-rich (45 amino acid 
residues/ out of 259, i.e 17%), raising the possibility of a 
regulatory role that may be controlled by serine/threonine 
kinas si 
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Outside the ORF, some interesting features in the cDNA 
can be observed. In the 3* untranslated region (UTR) , which 
is 533 3 bp long, a polyadenylation signal is present star- 
ting at position 7297, and there is also a TG-repeat, which 
5 might be of regulatory relevance. In addition, there is an 
ATTTA sequence motif, which has previously been shown to 
mediate mRNA destabilization in certain lymphokine and 
immediate early genes (Sachs (1993)). The ATTTA motif is 
repeated twelve times in the 3 9 -UTR. 

10 No sequence similarity was found to any of the known 

zinc finger proteins but preliminary studies have indicated 
that the human genome contains at least one additional gene 
that is closely related to our candidate pleomorphic adenoma 
gene. Nine anonymous ESTs (expressed sequence tags) with 

15 sequence similarity to sequences of our new gene were found 
in public databases and these ESTs could be aligned resul- 
ting in a contiguous sequence of about 1 kb. An open reading 
frame appeared to be present with coding potential for 
protein sequences with zinc-finger-like features. Sequence 

2 0 identity between the aligned ESTs and our new gene appeared 
to be about 75% at the amino acid sequence level; i.e. in 
the region of our gene encoding zinc fingers 4 to 7 . Of 
interest to note furthermore is that these anonymous ESTs 
sequences have been mapped to chromosome 6q2 4, which is 

2 5 implicated in tumors of the salivary glands. 

3.3 Genomic organization of the PLAG1 gene 

Comparison of transcribed and genomic DNA sequences of 
the PLAG1 gene revealed that it contains 5 exons (Fig. 3 A) . 

3 0 The transcriptional orientation of the gene is directed 

towards the centromere. The first three exons are noncoding, 
the fourth exon contains the translation start site (ATG) , 
and the amino-terminus of the protein including one complete 
zinc finger domain. The second finger is split by intron 4 
3 5 and continues into exon 5, which contains the remaining part 
of the ORF and the long 3»-UTR (5533 bp). Based on our YAC 
and cosmid maps, the PLAGl locus spans about 3 5 kb, with a 
large intron (approximately 25 kb) between exon 1 and exon 
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2. 3 4 -and 5 1 -RACE analysis revealed the existence of two 
alternatively spliced mRNAs which differ from each other by 
the presence or absence of the noncoding exon 2. The diffe- 
rence in electrophoretic mobility of the two mRNA isoforms 
5 is too small to distinguish them by Northern blot analysis. 
The functional relevance of this alternative splicing re- 
mains to be elucidated. 

3.4 Rearrangements of PLAG1 in primary pleomorphic adenoma 
10 In an attempt to establish whether the 8ql2 aberrations 

in primary pleomorphic adenomas with t (3 ;8) (p21;ql2) led to 
rearrangements in PLAGl, Southern blot experiments were 
performed using various PLAGl -derived probes. With a probe 
within the ORF (STS probe EM265) or the 3 , -UTR (KK64) of 

15 PLAGl, no rearrangements were detected. However, using 
probes corresponding to the 5» noncoding region of PLAGl , 
i.e. intron 1 (STS probes EM416 or EM317) or exon 2 (STS 
probe EM440) , rearrangements were detected in each of the 
t(3;8) tumors tested (data not shown). Similar Southern blot 

20 analysis of primary pleomorphic adenoma CG580, which carries 
a t(8;15) (ql2;ql4) , revealed also a rearrangement in the 5 1 
noncoding region of PIAGl (Fig. 5). These results seem to 
directly implicate PLAGl as a critical gene in pleomorphic 
adenoma tumor igenes is . Furthermore, the mechanism might 

25 involve transcriptional deregulation of PLAGl . since the 
rearrangements in PLAGl were invariably found in the 5 ■ 
noncoding region of PLAGl leaving its coding region intact. 

3.5 t(3;8) results in promoter swapping between PLAGl and 
3 0 the gene for B-catenin, CTNNB1 

The observation that the 8ql2 translocation breakpoints 
in various primary tumors were mapped to intron 1 or intron 
2 of the 5 1 -noncoding region of PLAGl raised the possibility 
that the t(3;8) results in the production of a chimeric 
3 5 transcript consisting of PLAGl sequences fused to those of a 
gene located on chromosome 3p21. To test this possibility, 
5 , -RACE experiments were performed using total RNA of the 
primary tumors CG644 and CG682, which both carry a 3, -8- 
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rearrangement. We used a primer specific for exon 5 (MV2) in 
the cDNA synthesis. Following ligation of an adaptor to the 
cDNA, PCR amplification was performed using , in the first 
round, an adaptor-specific primer and a PIAGl -specif ic 
5 primer corresponding to sequences of exon 5 (MVS) and; in a 
second round , nested adaptor-specific primer and PLAG1 - 
specific primer corresponding to sequences of exon 4 (MV6) . 
Nucleotide sequence analysis of the PCR product revealed 
that the ectopic sequences were fused to the acceptor splice 

10 site of exon 3 of PLAG1. BLAST analysis revealed that they 
were identical to exon 1 sequences of CTNNB1 (Nollet et al. 
(1996)), the gene for B-catenin, which has previously been 
assigned to chromosome 3p21 (Kraus et al. (1994)). 

Independent evidence for the involvement of CTNNB1 in 

15 the t (3; 8) -rearrangements in pleomorphic adenomas was obtai- 
ned by FISH using two YACs (756G5 and 750D3) , which contain 
the complete gene for B-catenin. According to published maps 
YAC 756G5 is contained within the proximal part of mega-YAC 
750D3 (Gemmill et al. (1995)). As expected, both YACs 

20 spanned the 3p21 breakpoint in adenoma CG682, with 

hybridization signals on the normal 3, the der(3) and the 
der(8) (data not shown). Detailed analysis of a few 
prometaphase cells enabled us to sublocalize the 6-catenin 
locus to band 3p21.3, which is in accordance with previous 

25 mapping data. 

To extend our observation to a larger number of tumors, 
we applied a reverse transcription-polymerase chain reaction 
(RT-PCR) approach. RT-PCR amplif ications using primers 
specific for CTNNB1 and PIAG1 were carried out with RNA from 

3 0 tumors CG368, CG588, CG644, CG752, CG753 and T9587 , which 
all carry the recurrent t(3;8), and from tumor CG682, which 
carries an ins(8;3) (ql2 ;p21. 3pl4 . 1) . RNA from tumor CG580, 
which carries a t(8;15), was included as a negative control, 
PCR experiments resulted in the generation of PCR products 

3 5 corresponding to hybrid transcripts consisting of PLAGl and 
CTNNB1 sequences, in seven out of seven t(3;8) tumors analy- 
zed (Fig. 6). In tumors CG368, CG588, CG682, CG752, and 
T9587, PCR products of 509 bp (Fig. 6A, PCR product A) and 
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614 bp (Fig. 6A, PCR product B) were generated, whereas in 
tumors CG644 and CG753, only the PCR product of 509 bp was 
found. The PCR product of 509 bp (from NECAT-UP up to MV6) 
is corresponds to a hybrid transcript containing exon 1 of 
5 CTNNBl and exons 3 to 5 of PLAG1 . The PCR product of 605 bp 
contains an extra 105 bp, which corresponds to the alterna- 
tively spliced exon 2 of PLAGl . it points towards the pre- 
sence of a related isoform consisting of exon 1 of CTNNBl 
and exons 2 to 5 of PLAGl. This was also confirmed by nucle- 

10 otide sequence analysis of the PCR products. We were also 
able to demonstrate that the corresponding reciprocal fusion 
transcripts are expressed. In all tumors except CG682, a PCR 
product of 130 bp was generated corresponding to a fusion 
transcript consisting of exon l of PLAGl and exons 2 to 16 

15 of CTNNBl . In tumor CG753, an additional PCR product was 

detected, corresponding to a fusion transcript consisting of 
exons 1 to 2 of PLAGl and exons 2 to 16 of CTNNBl . This 
additional band was also observed but with weak intensity in 
tumor CG644. All these results indicate that in CG644 and 

20 CG753, the 8ql2 translocation breakpoints are located in 
intron 2, whereas the breakpoints of tumors CG368, CG588, 
CG682, CG752, and T9587 are located in intron 1. Interestin- 
gly, using 5« RACE analysis with the tumor CG580, which 
carries a t(8;l5) translocation, we found that the break- 

25 point occurs also in the same region leading to a fusion 
transcript with ectopic fused to exon 3 of PLAGl . 

3.6 Activation of PLAGl expression due to promoter swapping 
Northern blot analysis was performed to evaluate the 

30 effect of the t(3;8) ( P 21;ql2) translocation on the expressi- 
on levels of PLAGl and CTNNBl (Fig. 7) . As previously menti- 
oned, PLAGl is expressed as a 7.5 kb transcript, which was 
readily detected in human fetal lung, liver, and kidney but 
not in fetal brain. In adult tissues, the 7.5 kb transcript 

35 was not detected in human heart, brain, lung, liver, skele- 
tal muscle, kidney, pancreas, and salivary gland. Low levels 
of PLAGl expression was found in human placenta (Fig. 7 A) . 
In contrast, the CTNNBl gene was ubiquitously expressed as a 



WO 98/07748 PCT/EP97/04759 

45 

3.8 kb RNA doublet in all tissues tested (Fig. 7B) . Similar 
results were obtained in the Northern evaluation of fetal 
and adult mouse tissues (data not shown) . 

Northern blot analysis was performed on two pleomorphic 
5 adenomas with t(3;8) and on one tumor with a variant t(8;l5) 
(Fig. 7C) . in contrast to normal salivary gland tissue in 
which no PLAGl transcripts could be detected, the three 
tumors expressed a 7.5 kb transcript. This transcript does 
not correspond to the normal PLAGl transcript (exons 1 to 

10 5) , since an exon 1-specif ic probe failed to hybridize to 
this transcript (data not shown) . Instead, a probe specific 
for exon l of CTNNBl recognized this 7.5 kb transcript as 
well as the 3.8 kb doublet of CTNNBl transcripts in the two 
tumors with a t(3;8). in contrast, only the 3.8 kb doublet 

15 of CTNNBl transcripts was detected in adenoma CG580 with a 
variant t(8;15). The expression level of CTNNBl in this 
tumor was about twofold higher as compared to tumors CG644 
and CG682, which is expected since none of the CTNNBl alle- 
les is rearranged. Collectively, these results suggest that 

20 piiAGl expression in these pleomorphic adenomas is driven by 
the 5 1 regulatory sequences either of the CTNNBl gene for 
the cases with t(3;8) or of an as yet unknown gene on chro- 
mosome 15 for the case with t(8;15). These results indicate 
that the principal mechanism in pleomorphic adenomas with 

25 8ql2 rearrangements is activation of PLAGl expression, due 
to promoter swapping between PLAGl and a translocation 
partner gene, preferentially CTNNBl . Furthermore, expression 
of the translocation partner gene is likely to be down- 
regulated concomitantly; e.g. as demonstrated for CTNNBl . 

30 

Discussion 

We have characterized a novel gene, PLAGl , which we 
propose is a critical locus involved in the development of 
pleomorphic adenoma of the salivary glands. The gene was 
35 identified on the long arm of chromosome 8, close to the MPS 
proto-oncogene, as a result of a positional cloning project 
to molecularly define the t (3 ;8) ( P 21;ql2) , which is the most 
frequent chromosome aberration in pleomorphic adenomas. We 
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have also identified the translocation partner gene at 3p21, 
i.e. the CTNNB1 gene which encodes B-catenin. In addition to 
8ql2 and 3p21, chromosome rearrangements involving 12ql3-l5 
are also frequently observed in pleomorphic adenomas. The 
5 high mobility group protein gene HMGIC was recently identi- 
fied as the gene consistently rearranged in these cases 
(Schoenmakers et al. (1995)). In conclusion, we have 
molecular ly defined the three main chromosomal 
rearrangements implicated in the genesis of pleomorphic 
10 adenomas. 

Structural rearrangement of the 8qll-l3 region are 
consistently found also in benign lipoblastomas. These 
tumors are considered to be unique variants of lipoma, 
occurring primarily in children before 3 years of age. Among 
15 the seven cases so far analyzed cytogenetically, rearrangem- 
ents involving 8qll-l3 were observed in six cases (Mitelman 
(1994)). Similar to pleomorphic adenomas, the translocation 
partners of 8ql2 are variable. The similarities in 
chromosomal pattern between these two tumor types suggest 

2 0 that PLAG1 is the target of the 8qli-i3 rearrangements also 

in lipoblastomas. 

We have also elucidated the molecular consequences of 
the t(3;8) translocation. The 8ql2 translocation breakpoints 
were invariably found in the 5 ' -non-coding region of PIAGI 
25 in all seven primary tumors tested. The breakpoints in the 
CTNNBl gene were always found in the first intron. As a 
result of the translocation, the coding sequences of PIAGI 
are brought under control of the 5« regulatory sequences of 
the CTNNB1 gene, and vice versa, resulting in the activation 

3 0 of PIAGI and down-regulation of CTNNBl. Based on their 

expression patterns in normal fetal and adult tissues, it is 
clear that the regulation of expression of the two genes is 
very different. The CTNNBl gene is highly and ubiquitously 
expressed, whereas PIAGI is development ally regulated, with 
3 5 expression restricted to fetal tissues. It should be empha- 
sized that in adult tissues, PIAGI is either not expressed 
or expressed at very low levels. In the adenomas, both the 
CTNNB 1 f PT AG 1 and the reciprocal PIAGI /CTNNBl transcripts 
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were detected by RT-PCR; detection of the latter transcripts 
sometimes required three rounds of PCR, indicating that the 
expression levels are very low. Our findings represent the 
first example of reciprocal exchange of expression control 
5 elements in solid tumors. Since the coding sequences of both 
genes are invariably preserved, the molecular mechanism 
could be classified as promoter swapping, resulting in 
activation of PIAGl and down-regulation of CTNNBl 
expression. 

10 Since the t(3;8) in pleomorphic adenomas invariably 

leads to activation of PIAGl . it is of interest to evaluate 
possible physiological implications. The PIAGl gene encodes 
a protein with a deduced molecular weight of 56 kDa and a 
deduced pi of 8.56. Analysis of the open reading frame of 

15 PIAGl reveals seven zinc fingers in the amino-terminal 

region. The carboxy-terminal region is rich in serine resi- 
dues. Furthermore, two potential nuclear localization sig- 
nals are present (residues r 2 2 -2 5 and 29-32). Collectively, 
this suggests that the PIAGl protein is a novel member of 

20 the large zinc finger gene family. Zinc finger motifs were 
originally identified as DNA binding structures in the RNA 
polymerase III transcription factor TFIIIA, which binds to 
the internal control region of the 5S RNA gene. TFIIIA-like 
zinc fingers are present in a variety of regulatory proteins 

25 found in higher and lower eukaryotes. This type of zinc 

finger motif consists of «3 0 amino acids with two cysteine 
and two histidine residues (C2H2) that stabilise the domain 
by tetrahedrally coordinating a Zn 2+ ion. A region of «12 
amino acids between the invariant cysteine-histidine pairs 

3 0 is characterized by scattered basic residues and several 
conserved hydrophobic residues. Most zinc finger genes 
studied so far encode polymerase II transcription factors. 
Apart from transcriptional modulation and control of RNA 
metabolism, chromatin packaging might also constitute an 

35 important activity through which zinc finger proteins exert 
their r gulatory roles (El-Baradi & Pieler (1991)). it 
should also be noted that the presumed role of HMGIC is also 
in chromatin modelling ( Schoenmakers et al. (1995) ; Ashar et 
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al. (1995); Wolffe (1994)). Th mammalian genome is known to 
con-tain a large numb r of C2H2 zinc finger genes (Bellefroid 
et al. (1989)) and the number of such genes implicated in 
cancer is growing steadily. 
5 The function of the serine rich carboxy-terminal part 

of PLAGI is unknown but it may have a regulatory function 
that can be controlled by serine/threonine kinases. If plagi 
encodes a DNA binding protein, as the presence of its zinc 
fingers suggests, the carboxy-terminal region might 
10 represent the transactivation domain. At least four 

different primary sequence motifs that characterize the 
activation domains are identified thus far, i.e. acidic, 
glutamine-rich, proline-rich and serine/threonine-rich. They 
likely represent regions that functionally interact with 
15 other proteins (Mitchell & Tjian (1989)). 

Activation of PLAGI due to promoter substitution may 
lead to deregulation of genes normally controlled by plagi . 
If over express ion of PLAGI is indeed a major molecular 
consequence of the 8ql2 translocations than this could 
20 possibly be achieved also by other mechanisms, including for 
instance an increase in gene copy number. In this context it 
should be noted that a small subgroup of pleomorphic adeno- 
mas show trisomy 8, often as the sole anomaly (Mitelman 
(1994)). In addition, trisomy 8 is a common numerical 
25 abnormality found in several histological subtypes of 
malignant salivary gland tumors as well as in different 
subtypes of leukemias and sarcomas (Mitelman (1994)). 

The preferential fusion partner of PLAGI is the CTNNB1 
gene, which encodes 6-catenin, a cytoplasmic protein of 
30 about 88 kDa (Gumbiner & McCrea (1996)). Its frequent 
involvement in pleomorphic adenomas as found here might 
point towards a critical role of CTNNB1. Most likely, a role 
of CTNNBJ. is to provide an active promoter in front of the 
PLAG1 gene. However, since the observed promoter swapping 
35 between PLAG and CTNNBl leads to down-regulation of CTNNB1 
expression, it may also lead to other physiological 
consequences, especially since 6-catenin is a broad-range 
protein interface, as that has been implicated in highly 
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diverse processes, such as human colon cancer, epithelial 
cell adhesion, embryonal axis formation in Xenopus r and 
pattern formation in Drosoohila (Peifer (1993)). The B- 
catenin protein has also been found as structural component 
5 of adherens junctions (AJs) (Peifer (1993); Kemler (1993); 
Gumbiner (1996)), which are multiprotein complexes assembled 
around Ca 2 *-regulated cell adhesion molecules (cadherins) . 
8-Catenin binds directly to cadherins and acts as a protein 
interface between cadherin and the cytoskeleton. The 

10 cadherin-B-catenin complex mediates cell adhesion, 

cytoskeletal anchoring, and signalling, which are important 
processes for regulation of cell growth and behaviour. It 
has been suggested that AJ complexes may play a role in the 
transmission of signals for contact inhibition (Peifer 

15 (1993)). Down-regulation of B-catenin, as occurs in 

pleomorphic adenomas, could interfere with this transmission 
and lead to growth deregulation. Since B-catenin levels 
apparently are important physiologically, it might be that 
down-regulation of CTNNB1 expression plays some role in 

20 pleomorphic adenoma development, possibly synergistically to 
PLAG1 . In conclusion, as activation of PLAG1 is observed in 
all pleomorphic adenomas with 8gl2 involvement tested, this 
seems to constitute a major factor in salivary gland tumor 
formation. The critical role of down-regulated levels of S- 

25 catenin remains to be established. 

Previous studies of HMGIC have shown that rearrange- 
ments of this gene preferentially occur in particular cyto- 
genetic subgroups of benign mesenchymal tumors, with the 
only exception so far being a subgroup of pleomorphic adeno- 

30 mas (Schoenmakers et al. (1995)). When considering the 

findings in the latter type of tumors, it should be noted 
that, although they are epithelial in origin, they often 
also show a varying degree of mesenchymal differentiation. 
This may be explained by the fact that pleomorphic adenomas 

35 are thought to originate from a pluripotent reserve cell in 
the terminal duct system which possesses the capacity to 
differentiate into both epithelial and myoepithelial cells 
(Evans & Cruickshank (1970)). The latter cells may function 
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as facultative mesenchymal cells, and are responsible for 
the production of extracellular material, including myxo- 
chondroid stroma (Dardick et al. (1991)), An intriguing 
question in this context is of course whether HMGIC is 
5 preferentially affected in adenomas with a prominent stromal 
component while PLAGl is preferentially affected in tumors 
with little or no stroma. Histological re-examination of all 
but one of the adenomas with PLAGl involvement in this 
study, revealed that all cases contained mesenchymal 

10 components, including both myxoid and chondroid tissues, 
suggesting that mesenchymal differentiation is not 
restricted to adenomas with HMGIC involvement. It should be 
noted that previous cytogenetic observations have already 
indicated that chromosome 12ql3-l5 and 8ql2 rearrangements 

15 constitute mutually exclusive pathways for the evolution of 
pleomorphic adenomas (Sandros et al. (1990)). 

The identification and cloning of a novel "benign 
oncogene" activated by chromosome translocations constitute 
a major step towards an increased understanding of the 

2 0 molecular pathogenesis of benign neoplastic growth. Our 

findings may lead to new insights into the genetic differen- 
ces between benign and malignant tumors . In addition, the 
present findings may have diagnostic implications. The fact 
that PLAGl is not normally expressed in adult tissues, but 

25 activated in pleomorphic adenomas due to 8ql2 rearrange- 
ments, makes it a potentially useful tumor marker in the 
differential diagnosis of benign and malignant salivary 
gland tumors. On the basis of the fusion transcripts that 
are formed, highly specific diagnostic assays can be develo- 

30 ped, as will be illustrated in the following examples. 

FIGURE LEGENDS 
Figure 3 

A: Contig of 3 overlapping YACs (bold lines) , 27 cos- 

3 5 mids and 2 phages, containing 2 7 landmarks and spanning a 

300 kb DNA region on chromosome 8ql2. Contig elements are 
labelled or numbered and defined in the list below. Cosmid 
clones isolated from the arrayed chromosome 8 -specific 
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cosmid Library constructed at Los Alamos National Laboratory 
(LANL) (Wood et al. (1992)) are named after their unique 
microtiter plate addresses. #1 and #22 are genomic phage 
clones; #21 is a clone isolated from the non-arrayed LANL 
5 chromosome 8-specific cosmid library. The orientation of the 
contig on the long arm of chromosome 8 is given as well as 
the order of 27 landmarks. It should be noted that the 
contig is not scaled. Below the contig, the genomic 
organization and the relative location of the PIAG1 gene is 

10 given schematically, with exact sizes (bp) of its exons and 
estimated sizes (kb) of its introns. Noncoding sequences are 
represented as open boxes and coding sequences as black 
boxes. The relative positions of the translation initiation 
(ATG) and stop (TAG) codons in PIAGl are indicated. At the 

15 bottom of the figure, characteristics of the deduced protein 
encoded by PLAGl are given. Zinc fingers are labelled F1-F7. 
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(clone number, cosmid or phage clone, STS, accession number) 

30 

B: Mapping of the 8ql2 translocation breakpoint in an 
adenoma (CG644) with a t (3;8) (p21;ql2) . Cosmid CEM48 (white 
spots) was co-hybridized with an alpha-satellite probes one 
specific for chromosome 3 and the other for chromosome 8 
35 (black spots) . Hybridization signals were found on the 
normal 8, the der (3), and the der (8), indicating that 
CEM48 spans the t (3 ;8) (p21 ;ql2) breakpoint. Chromosomes are 
stained in blue with DAPI. 

40 Figure 4 

A: cDNA and deduced amino acid sequence of PLAGl . The 
relative positions of exon/intron boundaries are indicated 
by triangles (t) . The conserved C, F, L and H residues in 
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the zinc finger domains are underlined. Residues 22-2 5 and 
29-32 constitute two potential nuclear localization signals. 
A potential polyadenylation signal and the putative mRNA 
destabilizing ATTTA motifs in the S'-noncoding region are 
5 underlined. The nucleotide sequence of the complete cDNA has 
been deposited at GenBank under accession number XXX. 

B: Alignment of the seven zinc-finger-like motifs found 
in the deduced amino acid sequence of PLAG1 relative to the 
C2H2 consensus motif. The canonical C, F, L and H residues 
10 are given in bold. 

Figure 5 

Illustrative example of the detection of DNA rearrange- 
ment in PLAG1 of a primary pleomorphic adenoma using Sout- 
15 hern blot analysis. DNA of pleomorphic adenoma CG368 (panel 

I) and control DNA isolated from normal lymphocytes (panel 

II) were digested with restriction endonuclease BamHI (B) , 
EcoRI (E), Hindlll (H) , or PstI (P) , as indicated. A probe 
with specificity for exon 3 of PLAG1 (EM440) was used. The 

20 molecular weight markers are 23.1, 9.4, 6.5, 4.3, 2.3 and 
2.0 kb, respectively. 

Figure 6 

A: Detection of CTNNB 1/PLAGi and PLAG1 / CTNNB i fusion 
25 transcripts by RT-PCR in primary adenomas. I. CTNNB 1/plagi 
were detected using the RT-PCR protocol and primers descri- 
bed in detail in Methods. Primary tumors analyzed included 
CG368 (lane 1), CG588 (lane 2), CG644 (lane 3), CG682 (lane 
4), CG752 (lane 5) , CG753 (lane 6), T9587 (lane 7), and 
30 CG580 (lane 8). II. PLAG1 / CTNNB 1 fusion transcripts were 
detected similarly using the same samples as under "I 1 . 
Details of the primers used here are given in Methods Secti- 
on. PCR products are labelled A-D. 

B: Schematic representation of the nature and origin of 
35 CTNNB 1 ^PIjAGI and PLAG1/ CTNNB i fusion transcripts in primary 
pleomorphic adenomas with t (3;8) (p21;ql2) . At the top of the 
figure, the exon/ intron distribution of the PIAG1 gene is 
given, at the bottom, the exon/ intron distribution for the 
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CTNNB1 gene* Positions of chromosome breakpoints are indica- 
ted by an arrow (N ) . Translation initiation sites are indi- 
cated by asterisks (*) and stop codons by triangles (t) . A- 
D: schematic exon compositions of hybrid transcripts, as 
5 established by 5«-RACE analysis. A: cDNA sequence junction 
between exon 1 of CTNNB1 and exons 3-5 of PLAGl . B: cDNA 
sequence junction between exon 1 of CTNNB1 and exons 2-5 of 
PLAGl. C: cDNA sequence junction between exon 1 of PLAG1 and 
exons 2-16 of CTNNB1 . D: cDNA sequence junction between 
10 exons 1-2 of PLAG1 and exons 2-16 of CTNNB1. 

Figure 7 

A: Northern blot analysis of the expression pattern of 
PLAGl in normal human fetal tissues including brain (1), 
15 lung (2), liver (3), kidney (4) as well as adult tissues 
including heart (5), brain (6), placenta (7), lung (8), 
liver (9), skeletal muscle (10), kidney (11), and pancreas 
(12). 

B: Northern blot analysis of the expression pattern of 
20 the CTNNB1 gene in normal human fetal and adult tissues as 
described under n A M • 

C: Detection of CTNNB1 /PLAGl transcripts in pleomorphic 
adenomas by Northern blot analysis using exon 1 of CTNNB1 as 
a molecular probe. Lane l, RNA of CG644 (t(3;8)), lane 2, 
25 RNA of CG580 (t(8;15)), and lane 3, RNA of CG682 (ins 3p21 
(8ql2)). The CTNNB1 /PLAGl fusion transcript is indicated. 

D: Using the same blot as under "C" , detection of 
CTNNB1/PIAG1 transcripts using a probe with specificity for 
the 3 f UTR of PLAGl (probe KK64) . The CTNNB1/ PLAGl trans- 
30 cript is indicated. 

EXAMPLE 3 

The discovery of PLAGl made it possible to identify 
PLAGl-related genes. The possibility that the human genome 
3 5 contains such genes was raised by Southern blot data showing 
bands (weak hybridization signals). In this example, the 
identification of another member of the PLAG gene family 
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that might be of pathogenetical relevance is described, i.e. 
the PLAG2 gene. 

To isolate cDNA clones corresponding to other members 
of the PIAG1 gene family, a cDNA library in AZAP was used 
5 (Stratagene) . As molecular probe, the complete human PliAGl 
cDNA was used. Screening of the cDNA library was performed 
according to routine procedures using low stringency hybrid- 
ization conditions ( Schoenmaker s et al. (1994)). This 
resulted in the isolation of a number of overlapping cDNA 

10 clones presumably corresponding to another member of the 

PLAG family. From the inserts of these, a 2.7 kbp composite 
cDNA could be constructed that contained an open reading 
frame (1233 nucleotides) for a protein (411 amino acids) 
structurally similar to PLAGl . This protein was designated 

15 PLAG 2 . The 2.7 kbp human PLAG 2 cDNA fragment started 17 6 
nucleotides upstream of the ATG start codon. 

The nucleotide sequence of the composite PLAG2 cDNA and 
the deduced amino acid sequence of the corresponding PLAG 2 
protein are shown in Figure 8. Nucleotide sequences were 

20 determined according to the dideoxy chain termination method 
using the T7 polymerase sequencing kit of Pharmacia/LKB. DNA 
fragments, subcloned in the pBLUESCRIPT or pGEM-3Zf ( + ) 
vector, were sequenced using standard primers and primers 
synthesized based upon newly obtained sequences. Nucleotide 

25 sequence data were obtained from both strands and analyzed 
using the sequence analysis computer programs Genepro (Ri- 
verside Scientific) , PC/Gene and Intelligenetics (IntelliGe- 
netics, Inc. ) . 

To established that the isolated PLAG2 sequences cor- 

30 responded to expressed sequences, Northern blot analysis was 
performed according to routine procedures. Total RNA was 
extracted from various human tissues and cell lines using 
Trizol™ (GIBCO/BRL) . After precipitation, concentrations of 
RNA in the samples were estimated via spectroscopic determi- 

35 nation. 40 /zg of total RNA from each sample was run on 1% 
agarose gels containing 1 x MOPS (0.02 M MOPS (Sigma) pH 
7.0, 50 mM Na-acetate, 10 mM EDTA pH 8.0), and 0.66 M for- 
maldehyde (Sigma) in a 1 x MOPS buffer for 16 h at 4 V/cm. 
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After gel electrophoresis, RNA was transferred to Hybond N* 
filters (Amershaia) using 10 x SSPE (20 x SSPE: 3.0 M NaCI, 
0.16 M NaH 2 P0 4 pH 7.4, 0.02 M EDTA) for standard capillary 
blotting. After completion of the transfers, nylon filters 
5 were irradiated with ultraviolet light using a Stratalinker 
(Stratagene) . Prehybridizations of the blots were carried 
out for 3 h at 42 °C in hybridization buffer consisting of 5 
x SSPE, 10 x Denhardt's, 2% SDS, 50% formamide and 100 /ig/ml 
herring sperm DNA. For hybridizations, blots were incubated 

10 overnight in the same buffer as described for the prehybrid- 
izations. Subsequently, blots were washed for 40 min at room 
temperature in 2 x SSC containing 0.05% SDS and for 40 min 
in 0.1 x SSC containing 0.1% SDS at 50 °C. Autoradiographs 
were obtained by exposing Kodak X-AR film at -80 °C with 

15 intensifying screen. 

In Northern blot analysis using the complete 2,7 kb 
PLAG2 cDNA as molecular probe and rather stringent hybrid- 
ization conditions, four distinct mRNAs were detected. These 
varied in size and included mRNAs of about 4.4 kb, 3.1 kb, 

20 2.9 kb, and 1.9 kb. The 3.1 and 2.9 kb transcripts were 
preferentially seen in fetal tissue, whereas the other 
transcripts were seen preferentially in adult tissues. These 
results established the expression of the PLAG2 sequences in 
a variety of cell types. Furthermore, the detection of 

25 various mRNAs with the PLAG2 cDNA probe suggests the exis- 
tence of various isoforms and raises the possibility that 
various PLAG2 protein isoforms may be produced. 

EXAMPLE 4 

3 0 Detection of hybrid PIAG1 in salivary gland tumor cells. 

cDNA clones of the chromosome 3 -derived CTNNB1 gene 
were isolated and the nucleotide sequence thereof establis- 
hed. The nucleotide sequence data of a composite cDNA are 
shown in Fig. 9. The amino acid sequence of the CTNNB1- 
3 5 encoded protein was deduced. 

In 5'-RACE analysis of primary pleomorphic adenomas of 
the salivary glands, PLAG1 
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containing fusion trancripts were identified. S'-RACE expe- 
riments were performed using total RNA of the primary tu- 
mors. A primer specific for exon 5 (MV2) was used in the 
cDNA synthesis. Following ligation of an adaptor to the 
5 cDNA, PCR amplification was performed using, in the first 
round, an adaptor-specific primer and a PLAGl-specif ic 
primer corresponding to sequences of exon 5 (MV5) and, in a 
second round, nested adaptor-specific primer and PLAGl- 
specif ic primer corresponding to sequences of exon 4 (MV6 ) . 

10 Nucleotide sequence analysis of the PCR product revealed 

that the ectopic sequences were fused to the acceptor splice 
site of exon 3 of PLAG1 . BLAST analysis revealed that they 
were identical to exon 1 sequences of CTNNB1 . the gene for 
B-catenin, which has previously been assigned to chromosome 

15 3p21. 



EXAMPLE 5 

Diagnostic test for pleomorphic adenoma of the salivary 
glands 

20 Fine needle biopsies of patients having a pleomorphic 

adenoma of the salivkry gland (with a chromosome 8ql2 
aberration others with a normal karyotype) were taken. From 
the material thus obtained total RNA was extracted using the 
standard TRIZOL™ LS protocol from GIBCO BRL as described in 

25 the manual of the /manufacturer . This total RNA was used to 
prepare the first/ strand of cDNA using reverse transcriptase 
(GIBCO/BRL) and In oligo dT(17) primer containing an 
attached short additional nucleotide stretch. The sequence 
of the primer used is as described in Example 2 , under point 

30 2.6 [AAG GAT COG TCG ACA TC(T)17]. RNase H was subsequently 

y used to remove/ the RNA from the synthesized DNA/RNA hybrid 
molecule. PCR /Was performed using a gene-specific primer 
(Example 2, point 2.6) and a primer complementary to the 
attached short additional nucleotide stretch. The thus 

35 obtained PCR /product was analysed by gel electrophoresis. 
Fusion constructs were detected by comparing them with the 
background bands of normal cells of the same individual. 
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In an additional/experiment, a second round of hemi- 
nested PCR was performed using one internal primer and the 
^7 primer complementary to the short nucleotide stretch [AAG 
a / GAT CCG TCG ACA/TC (T) 17 ] . The sensitivity of the test was 
^(5 thus signif ica/vtly improved. 



EXAMPLE 6 

Diagnostic test for uterine leiomyoma. 

10 Uterine leiomyoma is the most common pelvic tumor in 

women, occurring with an incidence up to 77% of women of 
reproductive age when tumors are counted after 2 millimeter 
serial sectioning of consecutive hysterectomy specimens. 
Often multiple leiomyomas are present, with estimates as 

15 high as an average of 6.5 tumors per uterus. Although most 
patients with these steroid-dependent tumors are asymptoma- 
tic, symptomatic leiomyomas can be associated with abnormal 
uterine bleeding, pelvic pain, urinary dysfunction, sponta- 
neous abortions, premature delivery and infertility. The 

20 high incidence of this benign smooth muscle tumor constitu- 
tes a major public health problem with the diagnosis of 
myomatous uterus leading to over 200,000 hysterectomies 
performed annually in the United States. Furthermore, non — 
random inactivation of the X chromosome has clearly demon- 

25 strated that uterine leiomyomas are monoclonal proliferati- 
ons. Although leiomyomata typically are benign tumors, in a 
very low percentage (< 0.1%) they might be able to metasta- 
size or even progress to malignancy. 

Besides a normal karyotype which is being found in 

3 0 approximately 7 0% of the cases investigated, several cytoge- 
netically abnormal subgroups, can be distinguished among 
leiomyomata. Leaving out of consideration the group which 
shows random changes, one of the largest cytogenetic sub- 
groups (comprising approximately 25% of the cytogenetically 

3 5 abnormal tumors) was first described in 1988 and is charac- 
terized by the involvement of 12ql5 and/or 14q23-24, mainly 
as t(12;14) (ql4-15 ;q23-24 ) . Another subgroup, with a similar 
incidence, contains deletions involving the long arm of 



# 
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chromosome 7, with region q21-22 being the most probable 
commonly involved region. Another subset of leiomyomas is 
characterized by numerical aberrations, mainly trisomy 12 . 
This trisomy is found in approximately 10% of the cytogene- 
5 tically abnormal leiomyomas. Furthermore, chromosome 6p21 — 
pter, has been found to be recurrently involved in roughly 
5% of the cases studied* Finally, a small percentage (ap- 
prox. 3.5%) of leiomyomata shows t (1;2) (p36;p24) . In uterine 
leiomyomata with chromosome 12ql4-15 aberrations, the HMGIC 

10 gene is affected and in those with chromosome 6p aberrati- 
ons, the HMGIY gene is affected. 

The observation that 70% of the uterine leiomyomata 
tested show a normal karyotype prompted studies to investi- 
gate whether activation of a PIAG gene could be found in 

15 uterine tumors. Therefore, Northern blot and PCR analysis 
were performed. 

Specimens of tumor material surgically removed from 
patients and classified by routine pathology as uterine 
leiomyoma, uterine leiomyosarcoma, f ibroleiomyoma, or uteri- 

20 ne fibroma were used. From the material thus obtained, total 
RNA was extracted using the standard TRIZOL™ LS protocol 
from GIBCO BRL as described in the manual of the manufactu- 
rer. This total RNA was used in Northern blot analysis, 
which revealed activation of PIAG1 in about 50% of the 

25 tumors tested, indicating that specific PIAG activation is 
also observed in these types of tumors. These results were 
confirmed by PCR analysis. These results reveal the possibi- 
lities of using PLAG1 as a diagnostic marker on routine 
biopsies taken from patients. Since PLAG1 is implicated in 

3 0 this type of tumor, PLAG1 can be used as a target for the 
development of novel therapeutic strategies for uterine 
leiomyoma . 

EXAMPLE 7 

3 5 Diagnosis of lipoblastoma. 

Lipoblastomas are benign pediatric tumors that are 
assumed to result from proliferation of primitive adipocy- 
tes. These type of tumors often grow rapidly but there are 
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no reports that they metastasize. Because of the rapid 
growth and their histopathological features, lipoblastomas 
can mimic myxoid or well-differentiated liposarcoma which 
makes the diagnosis problematic. As a direct consequence, 
5 therefore, some of these tumors have been treated in an 
unnecessarily agressive manner. 

A recurrent rearrangement involving the long arm of 
chromosome 8 has been reported for lipoblastomas. Southern 
blot analysis detected a rearrangement in the PLAG1 gene 
10 opening the possibility of using PLAG1 as a diagnostic 
marker . 

A biopsy of a patient having a lipoblastoma was taken. 
From the material thus obtained total RNA was extracted 
using the standard TRIZOL™ LS protocol from GIBCO BRL as 
15 described in the manual of the manufacturer. This total RNA 
was used to prepare the first strand of cDNA using reverse 
transcriptase (GIBCO/ BRL) and an oligo dT(17) primer contai- 
ning an attached short additional nucleotide stretch. The 
sequence of the primer used is as described in Example 2, 

2 0 under point 2.6. RNase H was subsequently used to remove the 

RNA from the synthesized DNA/RNA hybrid molecule. PCR was 
performed using a gene-specific primer (Example 2, point 
2.6) and a primer complementary to the attached short addi- 
tional nucleotide stretch. The thus obtained PCR product was 
25 analysed by gel electrophoresis. Fusion constructs were 
detected by comparing them with the background bands of 
normal cells of the same individual. 

In an additional experiment, a second round of hemi- 
nested PCR was performed using one internal primer and the 

3 0 primer complementary to the short nucleotide stretch. The 

sensitivity of the test was thus significantly improved. 

EXAMPLE 8 

PLAG2 as diagnostic marker. 
35 To evaluate the diagnostic potential of the PIAG2 gene, 

we have first identified on which human chromosome the PLAG2 
gene is locat d and subsequently established its subchromo- 
somal location. Such mapping studies could link the gene to 
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known tumor-specific chromosome anomalies. Using several 
different PLAG2 containing YACs and cosmids in FISH analy- 
sis, the PIAG2 gene was tentatively mapped to chromosome 
band 6g24, a chromosome segment implicated in various tumors 
5 such as for instance malignant salivary gland tumors. 

Malignant salivary gland tumors are a heterogeneous 
group of tumors comprising at least 17 different entities 
according to the most recent WHO classification (1991) . 
Because of the large number of tumor types and the wide 

10 morphological spectrum that these tumors show they constitu- 
te a well recognized diagnostic problem. The most frequent 
chromosome abnormalities so far observed among malignant 
salivary gland tumors are deletions or translocations invol- 
ving 6q. Most of the deletions so far encountered are 

15 terminal deletions with breakpoints located at 6q22-q25. 

However, there are also a few cases on record with intersti- 
al 6q deletions. The minimal common region lost in the 
majority of all cases is 6q24-qter. The 6q deletions are not 
restricted to certain types of malignant salivary gland 

20 tumors, but seem to occur in all major histologic subtypes, 
including adenoid cystic carcinoma, adenocarcinoma, undiffe- 
rentiated carcinoma, mucoepidermoid carcinoma and acinic 
cell carcinoma. The frequency of 6q rearrangements vary 
somewhat in different tumor types. In for example adenoid 

2 5 cystic carcinomas clonal 6q deletions or translocations have 

been found in nearly 50 % of the cases. The fact that the 6q 
deletions are often the sole karyotypic change, indicate 
that they are primary cytogenetic events of pathogenetic 
importance to malignant salivary gland tumors. In addition 
30 to 6q deletions a subgroup of adenoid cystic carcinomas are 
characterized by a t (6;9) (q21-24;pl3-23) . This is a primary 
abnormality that is diagnostic for this tumor types. It 
should also be noted that in addition to malignant salivary 
gland tumors there are several other tumor types with recur- 

3 5 rent deletions of 6q, including e.g. various types of leuke- 

mias and lymphomas, malignant melanomas and neuroblastomas. 



EXAMPLE 9 



WO 98/07748 



61 



PCT/EP97/04759 



Involvement of PLAG2 in malignant salivary gland tumors 

This example is performed to test whether the different 
6q deletions commonly seen in different histologic subtypes 
of malignant salivary gland tumors involve the PLAG2 gene at 
5 6q24. 

8 . 1 Material and methods 

FISH analysis using the PIAG2 containing CEPH mega- YACs 
798D8 and 921C2 was performed on metaphase chromosomes from 
10 two cases of malignant salivary gland tumors, one 

adenocarcinoma and one adenoid cystic carcinoma, with 6q 
deletions* The methods for cytogenetic analysis and FISH 
were the same as described for PLAG1. 



15 8.2 Results 

Hybridization with both YACs on metaphases from the 
adenocarcinoma resulted in signals of the same intensity on 
both chromosome 6 homologs at the expected position. There 
was no evidence of deletion of PLAG2 signals in this case. 

20 In contrast, the adenoid cystic carcinoma contained a 

subpopulation of cells, which in addition to the signal from 
the normal chromosome 6 showed either no signal or a weak 
signal from the 6q-marker chromosome. Both YACs gave a 
similar hybridization pattern. 

25 

8 . 3 Conclusion 

Preliminary studies suggest that at least one of the 
breakpoints in the 6q deletions in malignant salivary gland 
tumors map in the vicinity of the PIiAG2 locus. PLAG2 is 
3 0 therefore a candidate gene for the commonly ocurring 6q 
deletions in malignant salivary gland tumors. 

EXAMPLE 10 

Animal tumor models involving PLAG1 as tools in in vivo 
35 therapeutic drug testing 

On the basis of the acquired PLAG1 knowledge, animal 
tumor models can be developed as tools for in vivo therapeu- 
tic drug testing. To achieve this objective (for instance 
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for pleomorphic adenoma of the salivary gland) , two approa- 
ches can be used, gene transfer (generation of transgenic 
animals) on the one hand and gene targeting technology 
(mimicking in vivo of a specific genetic aberration via 
5 homologous recombination in embryonic stem cells (ES cells) ) 
on the other. These technologies allow manipulation of the 
genetic constitution of complex living systems in specific 
and pre-designed ways. For extensive technical details, see 
B. Hogan, R. Beddington, F. Constantini, and E. Lacy; In: 
10 Manipulating the mouse embryo, a laboratory manual. Cold 
Spring Harbor Press, 1994; ISBN 0-87969-384-3. 

To aim at the mutation of the PIAGl gene, specifically 
in selected cell types and selected moments in time, the 
recently described Cre/LoxP system can be used (Gu, H. et 



15 al . Deletion of a DNA polymerase 5 gene segment in T cells 
using cell type-specific gene targeting. Science 265, 103- 
106, 1994) . The Cre enzyme is a recombinase from bacteriop- 
hage PI whose physiological role is to separate phage geno- 
mes that become joined to one another during infection. To 

20 achieve so, Cre lines up short sequences of phage DNA, 

called loxP sites and removes the DNA between them, leaving 
one loxP site behind. This system has now been shown to be 
effective in mammalian cells in excising at high efficiency 
chromosomal DNA. Tissue-specific inactivation or mutation of 

25 a gene using this system can be obtained via tissue-specific 
expression of the Cre enzyme. 

As an example, the development of animal model systems 
for pleomorphic adenoma of the salivary glands using a 
member of the PLAG gene family will be outlined below, such 

30 that the models will be instrumental in in vivo testing of 
therapeutic drugs. 

Two approaches will be followed: 

a) in vivo induction of specific genetic aberrations as 
observed in human patients ((conditional) gene (isogenic) 

3 5 targeting approach) ; and 

b) introduction of DNA constructs representative for the 
genetic aberrations observed in patients (gene transfer 
approach) . 
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DNA constructs to be used in gene transfer will be 
generated on the basis of observations made in patients 
suffering from pleomorphic adenomas of th salivary glands 
as far as structure and expression control are concerned; 
5 e.g. PIAG1 fusion genes with various translocation partner 
genes, especially the preferential translocation partner 
gene CTNNB1 of chromosome 3, i.e. complete PIAG1 under 
control of a strong (salvary gland-specific) promoter. 

10 EXAMPLE 11 

The preparation of antibodies against PLAG-encoded proteins 

One type of suitable molecules for use in diagnosis and 
therapy are antibodies directed against the PLAG genes. Two 
approaches have been followed to develop them. Based upon 

15 the nucleotide sequence of the PLAG cDNAs, computer analysis 
was used to predict the location of antigenic determinants 
within the proteins. Synthetic peptides containing such 
determinants were prepared and rabbits were immunized with 
these. As an alternative approach, cDNA sequences can be 

2 0 expressed in appropriate pro- and eukaryotic expression 

systems and the polypeptides synthesized in this way can be 
purified and used for immunization of mice. Cell lines 
obtained upon transfection or electroporation of constructs 
of the PLAG genes were helpful in characterizing the antibo- 

25 dies obtained. 

Pparation of rabbit polyclonal antibodies 
the PLAGl-encoded proteins, use was made of 
ree commercially available peptides: 
GKR) 8-Multiple Antigen Peptide 
EQPL) 8 -MAP 
8 -MAP 
Research Genetics Inc., Huntsville, AL, USA. 
ntibodies were made according to standard 

35 With respect to PLAG2 . suitable antibodies were genera- 

ted using hybrid proteins as antigens. These contained 
various portions of the PLAG 2 coding region fused in frame 
to GST (glutathion-S-transferase) . 
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EXAMPLE 12 

Inhibition of PLAGl expression in tumor cells using antisen- 
se PIAG1 constructs. 

5 l*. Introduction 

PIAGl can be used as target in novel therapeutic proto- 
cols for tumors in which it is implicated; e.g. pleomorphic 
adenomas of the salivary glands, uterine leiomyoma , etc. 
Cancer as a genetic disease is a logical target for gene 

10 therapy, either by replacing the missing or inactivated gene 
or by suppressing the activity of an unwanted gene. The high 
affinity of short DNA sequences for their target mRNA has 
indicated that antisense oligonucleotides constitute valua- 
ble reagents to specifically suppress the production of gene 

15 products, both in vitro and in vivo . Applications have been 
documented in various biomedical research areas, such as for 
instance cancer research, virology, and cardiovascular 
research. The in vitro activity, toxicity, and pharmacok- 
inetic data of antisense oligonucleotides are encouraging 

20 and in vivo animal experiments demonstrating suppression of 
neointimal formation seem very promising. Also in the field 
of cancer research promising results have been obtained. 
However, potential nonspecific effects of antisense oligonu- 
cleotides should be carefully considered when antisense 

2 5 agents are used to define biological functions of specific 

genes • 

Since expression of the PLAGl gene is frequently acti- 
vated or strongly elevated in a wide variety of tumors and 
tumor cell lines, derived from tumors (rhabdomyosarcoma, 

3 0 uterine leiomyosarcoma, uterine leiomyoma, malignant 

salivary gland tumors) as well as leukemias and lymphomas, 
it was speculated that the PLAGl -encoded protein might play 
a key role in transformation of cells. This example shows 
that expression of the PLAGl gene can be strongly reduced by 
3 5 expressing antisense PIAGl sequences and that reduction of 
PLAGl levels in tumor cells results in reversion of the 
transformed phenotype. Thus the expression or administration 
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of antisense molecules can be successfully applied 
therapeutically . 

Materials and methods 
5 2.1 Tumor cell lines 

Tumor cell lines were generated from primary tumors as 
described by Kazmierczak et al. r Genes Chromosom. Cancer 
5:35-39, 1992. 



10 2.2 Culturing tumor cells 

Tumor cells were propagated in TC199 culture medium 
with Earle's salts, supplemented with 20% fetal bovine serum 
(GIBCO) , 200 IU/ml penicillin, and 200 microgram/ml strepto- 
mycin. 

15 

2.3 Transfection assay 

Transf ections were performed using various protocols, 
namely: 

1. The calcium phosphate precipitation procedure of Graham 

2 0 and Van der Eb ("A new technique for the assay of the infec- 

tivity of human adenovirus." Virology 52: 456-467, 1973.). 

2. Lipofection: Transf ections were carried out using liposo- 
me-mediated DNA transfer (lipof ectamine, GIBCO/BRL) accor- 
ding to the guidelines of the manufacturer. 

25 

2 . 4 Antisense constructs 

Sense and antisense constructs of the PLAG1 gene were 
obtained by inserting human PLAG1 cDNA sequences in both the 
sense and antisense orientation in expression vectors under 

3 0 transcriptional control of various promoter contexts, e.g. 

the long terminal repeat of Moloney murine leukemia virus, a 
CMV promoter, or the early promoter of SV4 0. For example, 
the CMV/ PLAG1 plasmid was constructed by cloning a human 
PLAG1 cDNA fragment containing all coding sequences of human 
3 5 PLAG1 in pRC/CMV (Invitrogen) allowing expression under 
control of the human cytomegalovirus early promoter and 
enhancer, and selection for G418 resistance. 
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3 . Results 

3.1 Inhibition of PIAG1 expression 

Inhibition of PLAG1 expression was observed in tumor 
cells after induction of antisense PLAG1 expression in these 
5 tumor cells. Immunoprecipitation and Western blot analysis 
indicated a strong reduction of PLAG1 protein levels in the 
cells expressing antisense PLAG1 sequences. Therefore, this 
approach can be used therapeutically in tumors with involve- 
ment with PLAGl. 
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